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METHODS OF SURVEYING FOR CC (BETA) CHEMOKINE 
RECEPTOR VARIANTS AND THEIR ASSOCIATION WITH HIV-l 
TRANSMISSION AND/OR DISEASE PROGRESSION 



BACKGROUND OF THE INVENTION 

Cross-^referftnee fn RpJared Application 
10 This non-provisional patent application claims benefit 

of provisional patent application U.S. Serial number 60/148,530 
filed August 12, 1999, now abandoned. 

Fftdftral Funding Legend 

15 This invention was produced in part using funds 

obtained through grants AI41951, AI40591 and AI42454 from 
NIAID, and grant DA04347 from NIDA. Consequently, the federal 
government has certain rights in this invention. 

20 Field of the Invention 

The present invention relates generally to geno typing 
CCR alleles in individuals and populations. More specifically, the 
present invention relates to methods of correlating CCR alleles 
and/or genotypes, specifically CCR5 promoter alleles, with HIV-1 
25 transmission and/or disease progression. 
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Descripfion of thft Related Art 

"Chemokine" describes a closely related family of 
'c/iemotactic cytokines' with conserved sequences, known to be 
potent attractors for various leukocyte subsets such as 
5 neutrophils, monocytes, or lymphocytes. Chemokines are a large 
superfamily consisting of four subfamilies that display between 
two and four highly conserved NH2-terminal cysteine amino acid 
residues. The CXC (or a) family has the first two NHj-terminal 
cysteines separated by one nonconserved amino acid residue. In 

10 contrast, the CC (or P) family has these cysteines in juxtaposition, 
and the C (or y) family has one lone NHj-terminal cysteine residue, 
while the CX3C (or 5) family has these cysteines separated by three 
intervening amino acids. The large number of chemokines and 
their receptors, together with the expression of chemokine 

15 receptors on cells other than leukocytes (such as epithelial, 
endothelial and smooth muscle cells) is indicative of the 
importance of these molecules. 

Six receptors for the CX: family of chemokines have 
been identified: CC-CCRl, CC-CCR2, CC-CCR3, CC-CCR4, CC-CCRS 

20 and the duffy blood group antigen. These proteins are seven 
transmembrane domain G protein-coupled receptors. In general, 
there is broad overlap in the ligands bound by the CC chemokine 
receptors. The human CCR5 and CCR2 chemokine receptor genes, 
which serve as co-receptors with CD4 for HIV, are tightly linked on 

25 chromosome 3p21-22, separated by 20 kb. 

The complex mechanisms for HIV-1 entry into human 
CD4* cells reflect in part its evolving usage of co-receptors (35). 
For example, macrophage-tropic HIV-1 isolates infect cells by 
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binding to the CD4* receptor and the CCR5 co-receptor (36), while 
T-cell-tropic viruses mainly use CXCR4 as the co-receptor (37). 
HIV-1 represents a close relative of simian immunodeficiency virus 
(SIV) that is naturally found in African primates, including 
5 chimpanzees and sooty mangabeys (15). Recent studies have 
confirmed that HIV-1 originated in the chimpanzee subspecies Pan 
troglodytes troglodytes from Central West Africa (16). Both HIV 
and SIV prefer CCR5 as their co-receptor for penetrating CD4+ cells 
(17-19), which suggests that CCR5 is commonly expressed in both 

10 human and non-human primates. Therefore, highly conserved 
transcription factor binding elements may represent a critical 
mechanism for regulating transcription of the CCR5 gene, and may 
serve as appropriate targets for intervention experiments. 
Additionally, other members of the CCR co-receptor gene family 

IS also facilitate viral transmission (38), but they appear to have 
more restricted cellular distribution (19,39-40). 

Recent work in AIDS cohorts has revealed associations 
between several polymorphisms in the CC (beta) chemokine 
receptor loci and either variable degrees of protection against HIV- 

20 1 transmission or variable evolution of the AIDS that follows (1-4). 
In Caucasians, homozygosity for a 32-bp deletion (A32) mutation 
in the CCR5 coding sequence leads to the absence of cell surface 
expression of CCR5, which confers nearly complete resistance to 
HIV-1 infection (1,41-44), Another less frequent mutation at 

25 nucleotide position 303 of the CCR5 coding sequence (m303) 
introduces a premature stop codon that also abolishes surface 
expression of CCR5 in Caucasians. Accordingly, an individual 
carrying both CCR5-A32 and m303 is resistant to HIV-1 infection 
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(45). Carriage of a single copy of CCRS- A32 in the presence of 
CCR5 wild-type provides little if any resistance to HIV-1 infection, 
but does tend to delay the progression of disease (1,4,46), 
possibly through interference with translocation of the wild-type 
5 product (47). Neither the CC/J5- A32 nor the CC/?5-m303 allele is 
frequent in African and Asian populations (1,43,48). However, 
African and Asian ethnic groups also show great variability in 
acquisition and progression of the disease. Population-specific 
polymorphisms within the CCi?J gene may exist (49), but none of 

10 these have been associated with varying degrees of viral 
susceptibility (48). The use of CCR5 as the co-receptor is not 
altered whether or not additional CCR5 polymorphisms are 
present, regardless of the HIV-1 subtype (50). 

In addition, extensive heterogeneities have been 

15 described in the expression and splicing of CCR5, apparently 
regulated by different CCR5 promoter alleles (3). More 
importantly, 2 CCR5 promoter variants have been associated with 
contrasting rates of HIV-1 disease progression (5,6). Expression of 
cell surface CCR5 also varies widely, even in individuals 

20 homozygous for wild type CCR5 (51,52), indicating differential 
regulation of CCR5 production. Therefore, variability in infectivity 
of primary HIV-l isolates that prefer CCR5 as the co-receptor is 
observed even in wild type individuals. Detailed study of the 
organization of the CCR5 gene and its promoter has uncovered 

25 extensive heterogeneity in the 5' untranslated region (UTR) of 
CCR5 mRNA (13). Some of these polymorphisms may be linked to 
changes in splicing and may account for continuing uncertainties 
about the intron and exon boundaries (13-14,53). One CCR5 
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promoter variant (59653T) shows strong linkage disequilibrium 
with CCR2b'641 (2) - a CXIR variant previously associated with 
retarded disease progression (4), and subsequent work has further 
detailed the relationships of CCR5 promoter markers to both 
5 CCR2b and CCR5-A32 (3,5,6). More recent association of the 
59029G/G genotype with reduced promoter activity (5) also 
suggests that CCR5 promoter polymorphisms independently 
modulate HIV-1 disease progression. 

The mechanisms and credibility of these specific 

10 genetic associations are under debate (3,10), especially because 
some of the C(ZR variants are tightly linked to each other (2). 
However, these findings highlight the importance of host genetic 
factors in HIV/AIDS and may guide development of new measuit > 
for the prevention and control of HIV-1 -related diseases (7-9). 

15 Polymorphisms at the CCRlb, CCR5, and CCR5 promoter 

loci have been analyzed in four ethnic groups, with a special 
emphasis on the relationships of CC/?5 promoter allelic variants to 
other well-characterized markers previously associated with 
different outcomes of HIV-1 transmission and disease progression. 

20 In addition, the same CCR5 promoter polymorphisms were 
analyzed in HIV-1 -infected Rwandan women, with a special 
emphasis on the influence of the genetic polymorphisms on HIV-1 
disease progression. 

The prior art is deficient in methods used to predict 

25 the likelihood and/or probability of HIV-l transmission and/or 
disease progression based upon CCR alleles and/or genotypes. The 
present invention fulfills a long-standing need for the development 
of a rapid and informative genotyping strategy that can be readily 
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applied to analyze CCR2, CCR5 and related genetic variants and to 

evalulate the relationship of each genotype to HIV transmission 
and disease progression. 



5 SUMMARY OF THE INVENTION 

Variability in HIV-1 infection has been associated with 
genetic variants in the beta-chemokine receptor 5 (CCR5) locus. 
Genetic variations (a 32-bp deletion and a point mutation) in the 

10 coding sequence of the HIV-1 co-receptor, CC/?5, have been shown 
to confer resistance to HIV-1 infection by depleting CCR5 
expression on the cell surface. While CC/?5 coding sequences have 
exhibited relatively limited variation, its promoter sequence 
appears polymorphic in all major populations. 

15 The studies reported herein revealed five major CCR5 

promoter alleles with distributions that differed widely among the 
four distinct ethnic groups examined. Herein, the methods of the 
instant invention are used to present evidence that particular 
genetic variants of the CCR5 promoter appear to determine the 

20 infectability of an individual {i.e., herein, of heterosexual women 
in Kigali, Rv^anda, and injecting drug users in New York, USA). 
Additionally, the present methods have been used to examine the 
relationship between the major CCR5 promoter genotypes and HIV- 
1 to AIDS disease progression {i.e., among 201 HIV-1 -infected 

25 Rwandan women). The effects of disease progression-related 
CCR2b and CCR5 polymorphisms on early HIV-1 viral load were 
also determined in a cohort of homosexual HIV-1 seroconverters. 
The methods of the present invention allowed the inventors to 
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establish an independent and strong linkage between HIV-1 
transmission and CCR5 promoter alleles using these distinct 
cohorts and further, to validate the methodology of the present 
invention. 

One object of the present invention is to provide 
methods of correlating CXR alleles and/or genotypes, specifically 
CCR5 promoter alleles, with HIV-1 transmission and/or disease 
progression. 

In an embodiment of the present invention, there is 
provided a method of surveying CCR genotypes in a population, 
comprising the steps of: (a) obtaining biological samples from a 
representative number of individuals in a population, with each 
sample being from a different individual, wherein the sample 
contains genomic DNA; (b) combining a portion of each sample 
15 with at least one experimental primer combination and a control 
primer combination, thereby producing primer-annealed DNA, 
wherein the experimental primer combinations are selected from 
the group consisting of SEQ ID Nos. 2 & 3, 2 & 4, 5 & 7, 5 & 8, 5 & 
9. 5 & 10, 5 & 11, 5 & 12. 5 i& 13, 6 & 7, 6 & 8. 6 & 12, 6 & 13. 14 
20 & 16 and 15 & 16, wherein the control primer combination is SEQ 
ID Nos. 17 & 18; (c) amplifying the primer-ann'^aled DNA in a 
reaction, thereby producing amplicons, wherein reaction 
conditions for the amplification are optimized for sequence- 
specific amplification, wherein each experimental primer 
25 combination and the control primer combination are predicted to 
produce one or more amplicons having expected sizes in 
basepairs; (d) separating the amplicons by size, wherein the 
presence of: a 197 bp amplicon with the control primer 
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combination is indicative of a CCR5 wildtype coding sequence; a 
165 bp amplicon with the control primer combination is indicative 
of a CCR5-A32 coding sequence; a 363 bp amplicon with 
experimental primer combination SEQ ID Nos. 5 & 7, a 367 bp 
5 amplicon with experimental primer combination SEQ ID Nos. 5 & 
10, a 412 bp amplicon with experimental primer combination SEQ 
ID Nos. 5 & 12 and a 309 bp amplicon with experimental primer 
combination SEQ ID Nos. 14 & 16 is indicative of a P*0101 CCR5 
promoter allele; a 363 bp amplicon with experimental primer 
10 combination SEQ ID Nos. 5 & 7, a 367 bp amplicon with 
experimental primer combination SEQ ID Nos. 5 & 10, a 412 bp 
amplicon with experimental primer combination SEQ ID Nos. 5 & 
13 and a 309 bp amplicon with experimeutiil primer combination 
SEQ ID Nos. 14 & 16 is indicative of a P*0102 CCR5 promoter allele; 
15 a 363 bp amplicon with experimental primer combination SEQ ID 
Nos. 5 & 7, a 363 bp amplicon with experimental primer 
combination SEQ ID Nos. 5 & 9, a 367 bp amplicon with 
experimental primer combination SEQ ID Nos. 5 & 11, a 412 bp 
amplicon with experimental primer combination SEQ ID Nos. 5 & 
20 13 and a 309 bp amplicon with experimental primer combination 
SEQ ID Nos. 14 & 16 is indicative of a P*0103 CCR5 promoter allele; 
a 363 bp amplicon with experimental primer combination SEQ ID 
Nos. 6 & 8, a 412 bp amplicon with experimental primer 
combination SEQ ID Nos. 6 & 13 and a 309 bp amplicon with 
25 experimental primer combination SEQ ID Nos. 14 & 16 is indicative 
of a P*0201 CCR5 promoter allele; a 363 bp amplicon with 
experimental primer combination SEQ ID Nos. 6 & 8, a 412 bp 
amplicon with experimental primer combination SEQ ID Nos. 6 & 
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13 and a 309 bp amplicon with experimental primer combination 
SEQ ID Nos. 15 & 16 is indicative of a P*0202 CCR5 promoter allele; 
(e) determining a CCR genotype for each sample based upon the 
CCR alleles indicated following step (d); and (f) compiling the 
5 genotypes determined in step (e), thereby genotyping the 
representative number of individuals in the population, thereby 
surveying CCR genotypes in the population. 

In another embodiment of the present invention, there 
is provided a method of surveying HIV-1 co-receptor CCR alleles in 

10 an individual, comprising the steps of: (a) obtaining a biological 
sample from an individual, wherein the sample comprises genomic 
DNA; (b) combining a portion of the sample with at least one 
experimental primer combination and a control primer 
combination, thereby producing' primer-annealed DNA, wherein 

15 the experimental primer combinations are selected from the group 
consisting of SEQ ID Nos. 2&3, 2&4, 5&7, 5&8, 5&9, 5&10, 

5 & 11, 5 & 12, 5 & 13, 6 & 7, 6 & 8, 6 & 12, 6 & 13, 14 & 16 and 
15 & 16, wherein the control primer combination is SEQ ID Nos. 17 

6 18; (c) amplifying the primer- annealed DNA in a reaction, 
20 thereby producing amplicons, wherein reaction conditions for the 

amplification are optimized for sequence-specific amplification, 
wherein each experimental primer combination and the control 
primer combination are predicted to produce one or more 
amplicons having expected sizes in basepairs; and (d) separating 
25 the amplicons by size, wherein the presence of: a 197 bp amplicon 
with the control primer combination is indicative of a CCR5 
wildtype coding sequence; a 165 bp amplicon with the control 
primer combination is indicative of a CCR5-A32 coding sequence; a 
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363 bp amplicon with experimental primer combination SEQ ID 
Nos. 5 & 7, a 367 bp amplicon with experimental primer 
combination SEQ ID Nos. 5 & 10, a 412 bp amplicon with 
experimental primer combination SEQ ID Nos. 5 & 12 and a 309 bp 
5 amplicon with experimental primer combination SEQ ID Nos. 14 & 
16 is indicative of a P*0101 CCR5 promoter allele; a 363 bp 
amplicon with experimental primer combination SEQ ID Nos. 5 & 
7, a 367 bp amplicon with experimental primer combination SEQ 
ID Nos. 5 & 10, a 412 bp amplicon with experimental primer 

10 combination SEQ ID Nos. 5 & 13 and a 309 bp amplicon with 
experimental primer combination SEQ ID Nos. 14 & 16 is indicative 
of a P*0102 CCR5 promoter allele; a 363 bp amplicon with 
experimental primer combination SEQ ID Nos. 5 & 7, a 363 bp 
amplicon with experimental primer combination SEQ ID Nos. 5 & 

15 9, a 367 bp amplicon with experimental primer combination SEQ 
ID Nos. 5 & 11, a 412 bp amplicon with experimental primer 
combination SEQ ID Nos, 5 & 13 and a 309 bp amplicon with 
experimental primer combination SEQ ID Nos. 14 & 16 is indicative 
of a P*0103 CCR5 promoter allele; a 363 bp amplicon with 

20 experimental primer combination SEQ ID Nos. 6 & 8, a 412 bp 
amplicon with experimental primer combination SEQ ID Nos. 6 & 
13 and a 309 bp amplicon with experimental primer combination 
SEQ ID Nos. 14 & 16 is indicative of a P*0201 CCR5 promoter allele; 
a 363 bp amplicon with experimental primer combination SEQ ID 

25 Nos. 6 & 8, a 412 bp amplicon with experimental primer 
combination SEQ ID Nos. 6 & 13 and a 309 bp amplicon with 
experimental primer combination SEQ ID Nos. 15 & 16 is indicative 
of a P*0202 CCR5 promoter allele. 
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In yet another embodiment of the present invention, 
there is provided a method of predicting the disease progression 
to AIDS in an HIV- 1 -infected individual, comprising the steps of: 
(a) obtaining a biological sample from an individual, wherein the 
sample comprises genomic DNA; (b) combining a portion of the 
sample with at least one experimental primer combination and a 
control primer combination, thereby producing primer-annealed 
DNA, wherein the experimental primer combinations are selected 
from the group consisting of SEQ ID Nos. 2 & 3, 2 & 4, 5 & 7, 5 & 
8, 5 & 9, 5 & 10, 5 & II, 5 & 12, 5 & 13, 6 & 7, 6 & 8, 6 & 12, 6 & 
13, 14 & 16 and 15 & 16, wherein the control primer combination 
is SEQ ID Nos. 17 & 18; (c) amplifying the primer-annealed DNA in 
a reaction, thereby producing amplicons, wherein reaction 
conditions for the amplification are optimized for sequence- 
15 specific amplification, wherein each experimental primer 
combination and the control primer combination are predicted to 
produce one or more amplicons having expected sizes in 
basepairs; (d) separating the amplicons by size, wherein the 
presence of: a 363 bp amplicon with experimental primer 
combination SEQ ID Nos. 6 & 8, a 412 bp amplicon with 
experimental primer combination SEQ ID Nos. 6 & 13 and a 309 bp 
amplicon with experimental primer combination SEQ ID Nos. 14 & 
16 is indicative of a CCR5 promoter genotype of P*0201/P*0201; a 
363 bp amplicon with experimental primer combination SEQ ID 
Nos. 5 & 7, a 367 bp amplicon with experimental primer 
combination SEQ ID Nos. 5 & 10, a 412 bp amplicon with 
experimental primer combination SEQ ID Nos. 5 & 13, a 363 bp 
amplicon with experimental primer combination SEQ ID Nos. 6 & 
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8, a 412 bp amplicon with experimental primer combination SEQ 
ED Nos. 6 & 13, a 309 bp amplicon with experimental primer 
combination SEQ ID Nos. 14 & 16 and a 309 bp amplicon with 
experimental primer combination SEQ ID Nos. 15 & 16 is indicative 
of a CCR5 promoter genotype of P*0102/P*0202; a 363 bp 
amplicon with experimental primer combination SEQ ID Nos. 5 & 
7, a 367 bp amplicon with experimental primer combination SEQ 
ID Nos. 5 4&: 10, a 412 bp amplicon with experimental primer 
combination SEQ ID Nos. 5 & 12, a 363 bp amplicon with 
experimental primer combination SEQ ID Nos. 6 & 8, a 412 bp 
amplicon with experimental primer combination SEQ ID Nos. 6 & 
13 and a 309 bp amplicon with experimental primer combination 
SEQ ID Nos. 14 & 16 is indicative of a CCR5 promoter genotype of 
P*0101/P*0201; a 363 bp amplicon with experimental primer 
combination SEQ ID Nos. 5 & 7, a 367 bp amplicon with 
experimental primer combination SEQ ID Nos. 5 & 10, a 412 bp 
amplicon with experimental primer combination SEQ ID Nos. 5 & 
12, a 363 bp amplicon with experimental primer combination SEQ 
ID Nos. 6 & 8, a 412 bp amplicon with experimental primer 
combination SEQ ID Nos. 6 & 13, a 309 bp amplicon with 
experimental primer combination SEQ ID Nos. 14 & 16 and a 309 
bp amplicon with experimental primer combination SEQ ID Nos. 1 5 
& 16 is indicative of a CCR5 promoter genotype of P*0101/P*0202; 
wherein a CCR5 promoter genotype of P*0201/P*0201 or 
P*0102/P*0202 is predictive of an accelerated rate of AIDS 
progression in the indiyidual (relative to an individual who does 
not possess the P*0201/P*0201 or P*0102/P*0202 gentoype), 
wherein a CCR5 promoter genotype of P*0101/P*0201 or 
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P*0101/P*0202 is predictive of a slower rate of AIDS progression 
in the individual (relative to an individual who does not possess 
the P*0101/P*0201 or P*0101/P*0202 gentoype). 

In still yet another embodiment of the present 
5 invention, there is provided a method of predicting the probability 
of HIV-1 infection in an individual, comprising the steps of: (a) 
obtaining a biological sample from an individual, wherein the 
sample comprises genomic DNA; (b) combining a portion of the 
sample with at least one experimental primer combination and a 

10 control primer combination, thereby producing primer-annealed 
DNA, wherein the experimental primer combinations are selected 
from the group consisting of SEQ ID Nos. 2 & 3, 2 & 4, 5 & 7, 5 & 
8, 5 & 9, 5 & 10, 5 & 11, 5 & 12, 5 & 13, 6 & 7, 6 & 8, 6 & 12, 6 & 
13, 14 & 16 and 15 & 16, wherein the control primer combination 

15 is SEQ ID Nos. 17 & 18; (c) amplifying the primer-annealed DNA in 
a reaction, thereby producing amplicons, wherein reaction 
conditions for the amplification are optimized for sequence- 
specific amplification, wherein each experimental primer 
combination and the control primer combination are predicted to 

20 produce one or more amplicons having expected sizes in 
basepairs; and (d) separating the amplicons by size, wherein the 
presence of: a 363 bp with experimental primer combination SEQ 
ID Nos. 6 & 8, a 412 bp with experimental primer combination SEQ 
ID Nos. 6 & 13 and a 309 bp with experimental primer 

25 combination SEQ ID Nos. 14 & 16 is indicative of a CCR5 promoter 
genotype of P*0201/P*0201; a 363 bp with experimental primer 
combination SEQ ID Nos, 5 & 7, a 367 bp with experimental primer 
combination SEQ ID Nos. 5 & 10, a 412 bp with experimental 
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primer combination SEQ ID Nos. 5 & 12 and a 309 bp with 
experimental primer combination SEQ ID Nos. 14 & 16 is indicative 
of a CCR5 promoter genotype of P*0101/P*0101; wherein a CCR5 
promoter genotype of P*0201/P*0201 is predictive of a decreased 
5 probability of HIV-1 infection in the individual (relative to an 
individual who does not possess the P*0201/P*0201 genotype), 
wherein a CCR5 promoter genotype of P*0101/P*0101 is predictive 
of an increased probability of HIV-1 infection in the individual 
(relative to an individual who does not possess the P*0101/P*0101 
10 genotype). 

In yet another embodiment of the present invention, 
there is provided a method of correlating CCR genotypes with HIV- 
1 transmission and/or disease progression, comprising the steps 
of: (a) obtaining biological samples from a representative number 

15 of individuals, wherein each sample is from a different individual, 
wherein the sample comprises genomic DNA; (b) assessing each 
individual's HTV-l status and/or risk of acquiring HIV-1; (c) 
assigning each individual to a risk group, wherein the assignment 
is based upon the individual's HTV-l status and/or risk of 

20 acquiring HIV-1; (d) combining a portion of each sample with at 
least one experimental primer combination and a control primer 
combination, thereby producing primer-annealed DNA, wherein 
the experimental primer combinations are selected from the group 
consisting of SEQ ID Nos. 2 & 3, 2 & 4, 5 & 7, 5 & 8, 5 & 9, 5 & 10, 

25 5 & 11, 5 & 12, 5 & 13, 6 & 7, 6 & 8, 6 & 12, 6 & 13, 14 & 16 and 
15 & 16, wherein the control primer combination is SEQ ID Nos. 17 
& 18; (e) amplifying the primer-annealed DNA in a reaction, 
thereby producing amplicons, wherein reaction conditions for the 

14 
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amplification are optimized for sequence-specific amplification, 
wherein each experimental primer combination and the control 
primer combination are predicted to produce one or more 
amplicons having expected sizes in basepairs; and (f) separating 
5 the amplicons by size, wherein the presence of: a 197 bp amplicon 
with the control primer combination is indicative of a CCR5 
wildtype coding sequence; a 165 bp amplicon with the control 
primer combination is indicative of a CCR5-A32 coding sequence; a 
363 bp amplicon with experimental primer combination SEQ ID 

10 Nos. 5 & 7, a 367 bp amplicon with experimental primer 
combination SEQ ID Nos. 5 & 10, a 412 bp amplicon with 
experimental primer combination SEQ ID Nos. 5 & 12 and a 309 bp 
aiMpUcon with experimental primer combination SEQ ID Nos. 14 & 
16 IS indicative of a P*0101 CCR5 promoter allele; a 363 bp 

1 5 amplicon with experimental primer combination SEQ ID Nos. 5 & 
7, a 367 bp amplicon with experimental primer combination SEQ 
ID Nos. 5 & 10, a 412 bp amplicon with experimental primer 
combination SEQ ID Nos. 5 & 13 and a 309 bp amplicon with 
experimental primer combination SEQ ID Nos. 14 & 16 is indicative 

20 of a P*0102 CCR5 promoter allele; a 363 bp amplicon with 
experimental primer combination SEQ ID Nos. 5 & 7, a 363 bp 
amplicon with experimental primer combination SEQ ID Nos. 5 & 
9, a 367 bp amplicon with experimental primer combination SEQ 
ID Nos, 5 & 11, a 412 bp amplicon with experimental primer 

25 combination SEQ ID Nos. 5 & 13 and a 309 bp amplicon with 
experimental primer combination SEQ ID Nos. 14 & 16 is indicative 
of a P*0103 CCR5 promoter allele; a 363 bp amplicon with 
experimental primer combination SEQ ID Nos. 6 & 8, a 412 bp 
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amplicon with experimental primer combination SEQ ID Nos. 6 & 
13 and a 309 bp amplicon with experimental primer combination 
SEQ ID Nos. 14 & 16 is indicative of a P*0201 CCR5 promoter allele; 
a 363 bp amplicon with experimental primer combination SEQ ID 
5 Nos. 6 & 8, a 412 bp amplicon with experimental primer 
combination SEQ ID Nos. 6 & 13 and a 309 bp amplicon with 
experimental primer combination SEQ ID Nos. IS & 16 is indicative 
of a P*0202 CCR5 promoter allele. 

In a further embodiment of the present invention, 
10 there is provided an oligonucleotide selected from the group 
consisting of SEQ ID Nos. 2-16. Other aspects, features, and 
advantages of the present invention will be apparent from the 
following description of the presently preferred embodiments of 
the invention. These embodiments are given for the purpose of 
15 disclosure. 

BRIEF DESCRIPTION OF THE DRAWINGS 



The appended drawings have been included herein so 
20 that the above-recited features, advantages and objects of the 
invention will become clear and can be understood in detail. 
These drawings form a part of the specification. The appended 
drawings illustrate preferred embodiments of the invention and 
should not be considered to limit the scope of the invention. 

25 Figure 1 shows a genetic map of CCR variants on 

chromosome 3 and the genotyping strategy of the present 
invention. Figure lA shows polymorphisms in the coding region 
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Of CCR2b and CCR5, known as 64V/I and CCR5A32, respectively, 
occur at nucleotide positions 46295 (based upon GenBank 
sequence U95626) and from 62036 to 62067, respectively. 
Nucleotides 59052G and 59530C enclose the region associated 
5 with basal and induced CCR5 promoter activities. Figure IB 
shows polymorphisms within and slightly beyond the boundaries 
of the CCR5 promoter. These polymorphisms define 6 allelic 
variants in these cohorts (P*0104 was only observed once in the 6 9 
sequenced samples). Single polymorphic positions (59029 and 

10 59653) and segments (boxed) in this region have been targeted in 
various studies and the corresponding alternative allele 
nomenclature is also shown (boxed). Figure IC shows that based 
upon the various features outlined above, genotyping can be 
performed utilizing sequence-specific primers (PCR-SSP-based 

15 typing) that simultaneously target all major variants in 15 different 
PGR reactions. The polarity (5'^3') and the 3' terminal sequence 
of each sequence-specific and general primer used in PCR-SSP are 
indicated by solid and open arrows, respectively. Three SSP 
reactions (2b, 2c and 3c, as indicated by broken lines) do not 

20 contribute to allele assignment but have the potential to detect 
new alleles involving the targeted polymorphic sites. The control 
product in all reactions carries the 32 bp deleted region in the 
CCR5 coding sequence. 

Figure 2 shows examples of PCR-SSP-based typing of 
25 CCR5 variants. Four samples (No. 1 to No. 4) are shown with 1 2 
selected SSP reactions (la, lb, 2a, 2d, 3a, 3b, 3d, 4a, 4b, 2e, 5a, 5 b 
(from left to right)). Arrows indicate the position of the control 
product that is present in all reactions. DNA size markers (M) 
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(New England Biolabs) are also included. Sample No. 1 has the 
genotype CCR5 wild type WT/WT, CCR2b-64V/I and CCR5 promoter 
P*0101/P*0202; Sample No. 2 has the genotype CCR5 WT/WT, 
CCR2b-64V/I and CCR5 promoter P*0102/P*0202; Sample No. 3 has 
5 the genotype CCR5 WTAVT, CCR2b.64V/I and CCR5 promoter 
P*0103/P*0202; Sample No. 4 has the genotype CCR5 WT/WT, 
CCR2b-64V/I and CCR5 promoter P*0201/P*0202. 

Figure 3 shows the allelic frequencies of CC/f2Z>-64I, 
CC R5' A32, and CCR5 promoter variants in four ethnic groups from 

10 the Bronx, New York (Caucasians, African- Americans, and 
Hispanics) and Kigali. Rwanda. The CCR2b'64l is invariably linked 
to CCR5 promoter allele P*0202 in all 4 groups, while the 
Caucasian- and Hispanic-specific CCR5-A32 is exclusively linked to 
CCR5 promoter allele P*0201 (see Tables 5 & 6). The frequency of 

15 CCR5'A32 in Caucaisan-Americans studied herein was lower than 
those reported elsewhere. 

Figure 4 shows the distribution of SDFl and CCR 
genotypes in participants from Kigali, Rwanda and the Bronx, New 
York. Genotypes that appeared to mediate HIV-1 transmission in 

20 ethnic Africans (African-Americans and Rwandans) are indicated 
by arrows. Genotypes that appeared to mediate HIV-1 
transmission in ethnic Caucasians (Caucasian- Americans and 
Hispanics) are indicated by asterisks. Genotypes showing 
associations with HIV-1 transmission in all ethnic groups are 

23 indicated by an arrow and an asterisk. 

Figure 5 shows the phylogenetic relationships among 
CC/?5 promoter sequences from humans (1-5; and see Figure 6), 
chimpanzees (6), pig-tailed macaques (7), and sooty mangabeys 
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(8). Figure 5A shows CCR5 promoter sequences from non-human 
primate species demonstrate nucleotide substitutions in 2 putative 
transcription-factor-binding (TFB) elements: SIE-like (sis/platelet- 
derived growth factor-inducible element-like) and TATA-like 
5 (TATA box-like AT-rich sequence). Figure SB shows pair-wise 
genetic distance matrix calculated using Kimura's two-parameter 
method (31). The alleles from non-human primates (GenBank 
accession numbers AF109384. API 15963, API 15964) are identical 
to human allele P*0102 at all 5 positions found to be polymorphic 

10 in humans. Figure 5C shows phenogram based on maximum 
likelihood method. Numbers next to branches indicate the 
frequencies supported by bootstrap re-analyses of the sequence 
data using both neighbor-joining method (32) in the FHYUP 
package (29) and parsimony method in the PAUP package (30). 

15 Grouping of human promoter allele P*0102 with P*0103 is not 
supported by neighbor-joining or parsimony analysis. 

Figure 6 shows the major CC/f5 promoter alleles from 
humans (1-5) and their reported and predicted relationship to 
HIV-1 disease progression. Figure 6 A shows the 5 major CCR5 

20 promoter alleles (deposited in GenBank with accession numbers 
AF109379, AF109380, AF109381. AF109382, AF109383, 
respectively) found in various human ethnic groups are defined by 
5 polymorphic sites in the region spanning nucleotide positions 
59029 to 59653 (relative :c GenBank accession number U59626). 

25 Figure 6B shows relationship between previously recognized 
CCR2b, CCR5 promoter genotypes and the CCR5 promoter alleles 
specified in (4 A). The + and - indicate rapid and slow HIV-1 
disease progression, respectively, as reported in several studies (4- 
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6). The predicted effect (+, rapid progression) of the 
P*020I/*0201 genotype can be inferred from the reported studies. 

Figure 7 shows polymorphic sites at the CCR2 and 
CCR5 loci. Nomenclature for the CCR5 upstream (P„) and 
5 downstream (Pd) sequence differs widely among studies. The 
alternative numbering of nucleotides is given in parentheses. 

Figure 8 shows PGR haplotyping of CCR5 upstream 
promoter variants using sequence-specific primers. 

Figure 9 shows nucleotides at polymorphic sites of 
10 stable CCR (CCR2-CCR5) haplotypes. 

Figure 10 shows HIV-1 viral load (VL) in people with 
different genotypes. Figure 10 A shows VL according to 58755 
(29) A/G and the linked CC/? J-A32. Figure lOB shows \L 
according to CC/?2-64V/I. VI, V2 and V3 were three different 
15 visits. 

Figure 11 shows HIV-1 viral load according to <XR 

haplotypes. 

Figure 12 shows the CCR genotypes exhibiting the 
most contrasting associations with viral load. 
20 Figure 13 shows the relationship between CCR5 

genotypes and disease progression. 

DETAILED DESCRIPTION OF THE INVENTION 

25 Variability in the natural history of HIV-1 infection has 

been associated with polymorphic genetic variants in the beta- 
chemokine receptor 5 (CCR5) locus. For instance, genetic 
variations in the HIV-1 co-receptor, CCR5, including a 3 2- bp 
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deletion and a point mutation, have been shown to confer 
resistance to HIV-l infection by depleting CC/?5 expression on the 
cell surface. The studies reported herein reveal five major CCR5 
promoter alleles with distributions that differed widely among the 
5 four distinct ethnic groups examined. Herein, evidence is 
presented that genetic variants of the CCR5 promoter also appear 
to determine the infectability of heterosexual women in Kigali, 
Rwanda, and injecting drug users in New York, USA. Additionally, 
the relationship of the major CCR5 promoter genotypes with HIV-1 

10 disease progression has been examined among 201 HlV-l-infected 
Rwandan women. The effects of disease progression-related 
CCR2b and CCR5 polyniiorphisms on early HIV-1 viral load were 
also determined in a cohort of homosexual HIV-1 seroconverters. 

In particular, promoter allele P*0103 (G59029-T59353- 

15 T59356-A59402-C59653, SEQIDNo. 21) was largely restricted to blacks. 
The promoter allele P*0202 (A59029-C59353-C59356-A59402-T59653, SEQ ID 
No. 23) was tightly linked to the slightly less frequent CCR2b'6Al, a 
variant of the CCR2b g^nt which is about 12.7 kb upstream of the 
promoter region. Another closely related promoter allele P*0201 

20 (A59029-C59353-C59356-A59402-C59653> SEQ ID No. 22) cxclusively carried 
the far less common CC/J5-A32, a 32-bp deletion in the CCR5 
coding sequence 2 kb downstream from the promoter. Among 
human, chimpanzee, pig-tailed macaque, and sooty mangabey 
promoter allelic sequences, the apparent ancestral lineage of the 

25 promoter sequence (G59029-T59353-C59356-A59402-C59653 = human 
P*0102, SEQ ID No. 20) was highly conserved across the primate 
species analyzed here, while P*0201 and P*0202 arose more 
recently than the other 3 major alleles. 

21 



BNSDOCID. <WO .0112B57A2 I > 



wo 01/12857 PCTAJSOO/22255 

Among the various CCR5 promoter genotypes 
comprising the live major alleles (P*0101, P*0102, P*0103, P*0201, 
and P*0202), the genotypes involving allele P*0201 were associated 
with protection against HIV-1 infection (OR=0.4, p<0.001 in Kigali; 
5 OR=0.5, /7=0.012 in New York), while the homozygous genotype 
P*0101/*0101 showed the opposite effect (OR=4.7, />=0.140 in 
Kigali; OR=4.1, />=0.002 in New York). According to multivariate 
logistic regression models, these observed promoter effects were 
independent of age, ethnicity and at least seven high-risk 
10 behavioral factors known to iniluence HIV-1 infection in the two 
cities: in Kigli, HIV-1 infection = 6.21 + 0.68 x (risk score) - 0.13 x 
age - 0.79 x (CCRS promoter genotypes involving P*0201); in New 
York, HIV-1 infection = 1.60 + 0.31 x (needle sharing) - 0.06 x age 
+ 1.41 X (CC/?5 promoter genotype P*0101/*0101) -0.83 x (being 
15 Caucasian). Differences in the adjusted genetic effects between 
the two cohorts reflected differences in the distribution of the two 
mutually exclusive promoter alleles involved. 

Two promoter genotypes (designated P*0201/*0201 
and P*0102/*0202) were weakly associated with accelerated HIV-1 
20 disease progression (p=0.072 and 0.058, respectively). In 
contrast, 2 promoter genotypes involving a common allele P*0101 
(P*0101/*0201 and P*0101/*0202) were collectively associated 
with slower disease progression ip=0.Q01, pc=0.015). Both 
P'=0102/*0202 and P*0101/*0202 genotypes always carried CCR2b' 
25 641, but CCR2b-64l alone had no detectable effect on disease 
progression. These findings support and refine some of the earlier 
observations regarding CCRS promoter polymorphisms and futher 
suggest that the previously recognized CCR2b-64l relationship is 

22 



eNS0OCI0:<WO 0n 2857A2 I > 



10 



" RCT/USOO/22255 

secondary to the CC/?5 promoter allelic effects. Thus, the present 
invention establishes an independent linkage between HIV-1 
transmission and the genetic variation within the CCR5 promoter 
alleles and further establishes a mechanism by which OCR 
chemokine receptor polymorphisms governs the initiation and 
pathogenesis of HIV-1 infection. 

The present invention is directed towards methods of 
correlating CXZR alleles and/or genotypes, specifically CCR5 
promoter alleles, with HIV-1 transmission and/or disease 
progression in an individual or population. 

The present invention is directed towards a method of 
surveying CCR genotypes in a population, comprising the steps of: 

(a) obtaining biological anples from a representative number of 
individuals in a population, wherein each sample is from a 

15 different individual, wherein the sample comprises genomic DNA; 

(b) combining a portion of each sample with at least one 
experimental primer combination and a control primer 
combination, thereby producing primer-annealed DNA, wherein 
the experimental primer combinations are selected from the group 
consisting of SEQ ID Nos. 2 & 3, 2 & 4, 5 & 7, 5 & 8, 5 & 9, 5 & 10, 

5 & 11, 5 & 12. 5 & 13, 6 & 7, 6 & 8, 6 & 12, 6 & !3, 14 & 16 and 
15 & 16, wherein the control primer combination is SEQ ID Nos. 1 7 

6 18; (c) amplifying the primer- annealed DNA in a reaction, 
thereby producing amplicons, wherein reaction conditions for the 

25 amplification are optimized for sequence-specific amplification, 
wherein each experimental primer combination and the control 
primer combination are predicted to produce one or more 
amplicons having expected sizes in basepairs; (d) separating the 
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amplicons by size, wherein the presence of: a 197 bp amplicon 
with the control primer combination is indicative of a CCR5 
wildtype coding sequence; a 165 bp amplicon with the control 
primer combination is indicative of a CCR5-A32 coding sequence; a 
5 363 bp amplicon with experimental primer combination SEQ ID 
Nos. 5 & 7, a 367 bp amplicon with experimental primer 
combination SEQ ID Nos. 5 & 10, a 412 bp amplicon with 
experimental primer combination SEQ ID Nos. 5 & 12 and a 309 bp 
amplicon with experimental primer combination SEQ ID Nos. 14 & 

10 16 is indicative of a P*0101 CCR5 promoter allele; a 363 bp 
amplicon with experimental primer combination SEQ ID Nos, 5 & 
7, a 367 bp amplicon with experimental primer combination SEQ 
ID Nos. S&IO, a412 bp amplicon with experimental primer 
combination SEQ ID Nos. 5 & 13 and a 309 bp amplicon with 

15 experimental primer combination SEQ ID Nos. 14 & 16 is indicative 
of a P*0102 CCR5 promoter allele; a 363 bp amplicon with 
experimental primer combination SEQ ID Nos. 5 & 7, a 363 bp 
amplicon with experimental primer combination SEQ ID Nos. 5 & 
9, a 367 bp amplicon with experimental primer combination SEQ 

20 ID Nos. 5 & 11, a 412 bp amplicon with experimental primer 
combination SEQ ID Nos. 5 & 13 and a 309 bp amplicon with 
experimental primer combination SEQ ID Nos. 14 & 16 is indicative 
of a P*0103 CCR5 promoter allele; a 363 bp amplicon with 
experimental primer combination SEQ ID Nos. 6 & 8, a 412 bp 

25 amplicon with experimental primer combination SEQ ID Nos. 6 & 
13 and a 309 bp amplicon with experimental primer combination 
SEQ ID Nos. 14 & 16 is indicative of a P*0201 CCR5 promoter allele; 
a 363 bp amplicon with experimental primer combination SEQ ID 
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Nos. 6 & 8, a 412 bp amplicon with experimental primer 
combination SEQ ID Nos. 6 & 13 and a 309 bp amplicon with 
experimental primer combination SEQ ID Nos. 15 & 16 is indicative 
of a P*0202 CCR5 promoter allele; (e) determining a CCR genotype 
for each sample based upon the CCR alleles indicated following 
step (d); and (f) compiling the genotypes determined in step (e), 
thereby genotyping the representative number of individuals in the 
population, thereby surveying CCR genotypes in the population. 

The present invention is additionally directed towards a 
method of surveying HIV-1 co-receptor CCR alleles in an 
individual, comprising the steps of: (a) obtaining a biological 
sample from an individual, wherein the sample comprises genomic 
DNA; (b) combining a portion of the sample with at least one 
experimental primer combination and a control primer 
15 combination, thereby producing primer-annealed DNA, wherein 
the experimental primer combinations are selected from the group 
consisting of SEQ ID Nos. 2 & 3, 2 & 4, 5 & 7, 5 & 8, 5 & 9, 5 & 1 0, 
5 & 11, 5 & 12, 5 & 13, 6 & 7, 6 & 8, 6 & 12, 6 & 13, 14 & 16 and 
15 & 16, wherein the control primer combination is SEQ ID Nos. 1 7 
20 & 18; (c) amplifying the primer-annealed DNA in a reaction, 
thereby producing amplicons, wherein reaction conditions for the 
amplification are optimized for sequence-specific amplification, 
wherein each experimental primer combination and the control 
primer combination are predicted to produce one or more 
25 amplicons having expected sizes in basepairs; and (d) separating 
the amplicons by size, wherein the presence of: a 197 bp amplicon 
with the control primer combination is indicative of a CCR5 
wildtype coding sequence; a 165 bp amplicon with the control 
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primer combination is indicative of a CCR5rA32 coding sequence; a 
363 bp amplicon with experimental primer combination SEQ ID 
Nos. 5 & 7, a 367 bp amplicon with experimental primer 
combination SEQ ID Nos. 5 & 10, a 412 bp amplicon with 
5 experimental primer combination SEQ ID Nos. 5 & 12 and a 309 bp 
amplicon with experimental primer combination SEQ ID Nos. 14 & 
16 is indicative of a P*0101 CCR5 promoter allele; a 363 bp 
amplicon with experimental primer combination SEQ ID Nos. 5 & 
7, a 367 bp amplicon with experimental primer combination SEQ 

10 ID Nos. 5 & 10, a 412 bp amplicon with experimental primer 
combination SEQ ID Nos. 5 & 13 and a 309 bp amplicon with 
experimental primer combination SEQ ID Nos. 14 & 16 is indicative 
of . ?*0102 CCR5 promoter allele; a 363 bp amplicon with 
experimental primer combination SEQ ID Nos. 5 & 7, a 363 bp 

15 amplicon with experimental primer combination SEQ ID Nos. 5 & 
9, a 367 bp amplicon with experimental primer combination SEQ 
ID Nos. 5 & 11, a 412 bp amplicon with experimental primer 
combination SEQ ID Nos. 5 & 13 and a 309 bp amplicon with 
experimental primer combination SEQ ID Nos. 14 & 16 is indicative 

20 of a P*0103 CCR5 promoter allele; a 363 bp amplicon with 
experimental primer combination SEQ ID Nos. 6 & 8, a 412 bp 
amplicon with experimental primer combination SEQ ID Nos. 6 & 
13 and a 309 bp amplicon with experimental primer combination 
SEQ ID Nos. 14 & 16 is indicative of a P*0201 CCR5 promoter allele; 

25 a 363 bp amplicon with experimental primer combination SEQ ID 
Nos. 6 & 8, a 412 bp amplicon with experimental primer 
combination SEQ ID Nos. 6 & 13 and a 309 bp amplicon with 
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experimental primer combination SEQ ID Nos. 15 & 16 is indicative 
of a P*0202 CCR5 promoter allele. 

The present invention is further directed towards a 
method of predicting the disease progression to AIDS in an HIV-1- 
5 infected individual, comprising the steps of: (a) obtaining a 
biological sample from an individual, wherein the sample 
comprises genomic DNA; (b) combining a portion of the sample 
with at least one experimental primer combination and a control 
primer combination, thereby producing primer-annealed DNA, 

10 wherein the experimental primer combinations are selected from 
the group consisting of SEQ ID Nos. 2 & 3, 2 & 4, 5 & 7, 5 & 8, 5 & 
9, 5 & 10, 5 & 11, 5 & 12, 5 & 13, 6 & 7, 6 & 8, 6 & 12, 6 & 13, 14 
& 16 and 15 & 16, wherein the control primer combination is SEQ 
ID Nos. 17 & 18; (c) amplifying the primer-annealed DNA in a 

15 reaction, thereby producing amplicons, wherein reaction 
conditions for the amplification are optimized for sequence- 
specific amplification, wherein each experimental primer 
combination and the control primer combination are predicted to 
produce one or more amplicons having expected sizes in 

20 basepairs; (d) separating the amplicons by size, wherein the 
presence of: a 363 bp amplicon with experimental primer 
combination SEQ ID Nos. 6 & 8, a 412 bp amplicon with 
experimental primer combination SEQ ID Nos. 6 & 13 and a 309 bp 
amplicon with experimental primer combination SEQ ID Nos. 14 & 

25 16 is indicative of a CCR5 promoter genotype of P*0201/P*0201; a 
363 bp amplicon with experimental primer combination SEQ ID 
Nos. 5 & 7, a 367 bp amplicon with experimental primer 
combination SEQ ID Nos, 5 & 10. a 412 bp amplicon with 
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experimental primer combination SEQ ID Nos. 5 & 13, a 363 bp 
amplicon with experimental primer combination SEQ ID Nos. 6 & 
8, a 412 bp amplicon with experimental primer combination SEQ 
ID Nos. 6 & 13, a 309 bp amplicon with experimental primer 
5 combination SEQ ID Nos. 14 & 16 and a 309 bp amplicon with 
experimental primer combination SEQ ID Nos. 15 & 16 is indicative 
of a CCR5 promoter genotype of P*0102/P*0202; a 363 bp 
amplicon with experimental primer combination SEQ ID Nos. S & 
7, a 367 bp amplicon with experimental primer combination SEQ 

10 ID Nos. 5 & 10, a 412 bp amplicon with experimental primer 
combination SEQ ID Nos. 5 & 12, a 363 bp amplicon with 
experimental primer combination SEQ ID Nos. 6 & 8, a 412 bp 
amplicon with experimental primer combination SEQ ID Nos. 6 & 
13 and a 309 bp amplicon with experimental primer combination 

15 SEQ ID Nos. 14 & 16 is indicative of a CCR5 promoter genotype of 
P*0101/P*0201; a 363 bp amplicon with experimental primer 
combination SEQ ID Nos. 5 & 7, a 367 bp amplicon with 
experimental primer combination SEQ ID Nos. 5 & 10, a 412 bp 
amplicon with experimental primer combination SEQ ID Nos. 5 & 

20 12, a 363 bp amplicon with experimental primer combination SEQ 
ID Nos. 6 & 8, a 412 bp amplicon with experimental primer 
combination SEQ ED Nos, 6 & 13, a 309 bp amplicon with 
experimental primer combination SEQ ID Nos. 14 & 16 and a 309 
bp amplicon with experimental primer combination SEQ ID Nos. 1 5 

25 & 16 is indicative of a CCR5 promoter genotype of P*0101/P*0202; 
wherein a CCR5 promoter genotype of P*0201/P*0201 or 
P*0102/P*0202 is predictive of an accelerated rate of AIDS 
progression in the individual (relative to an individual who does 
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not possess the P*0201/P*0201 or P*0102/P*0202 gentoype). 
wherein a CCR5 promoter genotype of P*0101/P*0201 or 
P*0101/P*0202 is predictive of a slower rate of AIDS progression 
in the individual (relative to an individual who does not possess 
5 the P*0101/P*0201 or P*0101/P*0202 gentoype). 

The present invention is also directed towards a 
method of predicting the probability of HIV-l infection in an 
individual, comprising the steps of: (a) obtaining a biological 
sample from an individual, wherein the sample comprises genomic 

10 DNA; (b) combining a portion of the sample with at least one 
experimental primer combination and a control primer 
combination, thereby producing primer-annealed DNA, wherein 
the experimental primer combinations are selected from the group 
consisting of SEQ ID Nos. 2 & 3, 2 & 4, 5 & 7, 5 & 8, 5 & 9, 5 & 10, 

15 5 & 11, 5 & 12, 5 & 13, .6 & 7, 6 & 8, 6 & 12, 6 & 13, 14 & 16 and 
15 & 16, wherein the control primer combination is SEQ ID Nos. 1 7 
& 18; (c) amplifying the primer-annealed DNA in a reaction, 
thereby producing amplicons, wherein reaction conditions for the 
amplification are optimized for sequence-specific amplification. 

20 wherein each experimental primer combination and the control 
primer combination are predicted to produce one or more 
amplicons having expected sizes in basepairs; and (d) separating 
the amplicons by size, wherein the presence of: a 363 bp with 
experimental primer combination SEQ ID Nos. 6 & 8, a 412 bp with 

25 experimental primer combination SEQ ID Nos. 6 & 13 and a 309 bp 
with experimental primer combination SEQ ID Nos. 14 & 16 is 
indicative of a CCR5 promoter genotype of P*0201/P*0201; a 363 
bp with experimental primer combination SEQ ID Nos. 5 7, a 367 
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bp with experimental primer combination SEQ ID Nos. 5 & 10, a 
412 bp with experimental primer combination SEQ ID Nos. 5 & 12 
and a 309 bp with experimental primer combination SEQ ID Nos. 
14 & 16 is indicative of a CCR5 promoter genotype of 
5 P*0101/P*0101; wherein a CCR5 promoter genotype of 
P*0201/P*0201 is predictive of a decreased probability of HIV-1 
infection in the individual (relative to an individual who does not 
possess the P*0201/P*0201 genotype), wherein a CCR5 promoter 
genotype of P*0101/P*0101 is predictive of an increased 
10 probability of HIV-1 infection in the individual (relative to an 
individual who does not possess the P*0101/P*0101 genotype). 

The present invention is yet further directed towards a 
method of correlating CCR genotypes with HIV-1 transmission 
and/or disease progression, comprising the steps of: (a) obtaining 

15 biological samples from a representative number of individuals, 
wherein each sample is from a different individual, wherein the 
sample comprises genomic DNA; (b) assessing each individual* s 
HIV-1 status and/or risk of acquiring HIV-1; (c) assigning each 
individual to a risk group, wherein the assignment is based upon 

20 the individual's HIV-1 status and/or risk of acquiring HIV-1; (d) 
combining a portion of each sample with at least one experimental 
primer combination and a control primer combination, thereby 
producing primer-annealed DNA, wherein the experimental primer 
combinations are selected from the group consisting of SEQ ID 

25 Nos, 2 & 3, 2 & 4, 5 & 7, 5 & 8, 5 & 9, 5 & 10, 5 & 11, 5 & 12, 5 & 
13, 6 & 7, 6 i& 8, 6 & 12, 6 & 13, 14 & 16 and 15 & 16, wherein the 
control primer combination is SEQ ID Nos. 17 & 18; (e) amplifying 
the primer-annealed DNA in a reaction, thereby producing 
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amplicons, wherein reaction conditions for the amplification are 
optimized for sequence-specific amplification, wherein each 
experimental primer combination and the control primer 
combination are predicted to produce one or more amplicons 
5 having expected sizes in basepairs; and (f) separating the 
amplicons by size, wherein the presence of: a 197 bp amplicon 
with the control primer combination is indicative of a CCRS 
wildtype coding sequence; a 165 bp amplicon with the control 
primer combination is indicative of a CCR5-A32 coding sequence; a 

10 363 bp amplicon with experimental primer combination SEQ ID 
Nos. 5 & 7, a 367 bp amplicon with experimental primer 
combination SEQ ID Nos. S & 10, a 412 bp amplicon with 
experimental primer combination SEQ ID Nos. 5 & 12 :tt\d a 309 bp 
amplicon with experimental primer combination SEQ ID Nos. 14 & 

15 16 is indicative of a P*0101 CCRS promoter allele; a 363 bp 
amplicon with experimental primer combination SEQ ID Nos. 5 & 
7, a 367 bp amplicon with experimental primer combination SEQ 
ID Nos. 5 & 10, a 412 bp amplicon with experimental primer 
combination SEQ ID Nos. 5 & 13 and a 309 bp amplicon with 

20 experimental primer combination SEQ ID Nos. 14 & 16 is indicative 
of a P*0102 CCR5 promoter allele; a 363 bp amplicon with 
experimental primer combination SEQ ID Nos. 5 & 7, a 363 bp 
amplicon with experimental primer combination SEQ ID Nos. 5 & 
9, a 367 bp amplicon with experimental primer combination SEQ 

25 ID Nos. 5 & 11, a 412 bp amplicon with experimental primer 
combination SEQ ID Nos. 5 & 13 and a 309 bp amplicon with 
experimental primer combination SEQ ID Nos. 14 & 16 is indicative 
of a P*0103 CCRS promoter allele; a 363 bp amplicon with 
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experimental primer combination SEQ ID Nos. 6 & 8, a 412 bp 
amplicon with experimental primer combination SEQ ID Nos. 6 & 
13 and a 309 bp amplicon with experimental primer combination 
SEQ ID Nos. 14 & 16 is indicative of a P*020l CCR5 promoter allele; 
5 a 363 bp amplicon with experimental primer combination SEQ ID 
Nos. 6 & 8, a 412 bp amplicon with experimental primer 
combination SEQ ID Nos. 6 & 13 and a 309 bp amplicon with 
experimental primer combination SEQ ID Nos. 15 & 16 is indicative 
of a P*0202 CCR5 promoter allele; (g) determining a CCR genotype 

10 for each sample based upon the CCR alleles indicated following 
step (f); and (h) analyzing the genotypes determined in step (g), 
thereby genotyping the representative number of individuals with 
respect to the risk group assigned each individual, thereby 
correlating CCR genotypes with HIV-1 transmission and/or disease 

15 progression. Representative biological samples include blood, 
serum, saliva, semen, tissue biopsy and isolated DNA. Generally, 
separation is by gel electrophoresis, but may also be by means 
such as size fractionation. A preferred method of amplification 
includes polymerase chain reaction (PCR). Typical means of 
20 analysis include two-tailed Fisher's exact test, multiple logistic 
regression analysis, univariate analysis and multivariate analyses. 

The present invention is additionally directed towards 
the oligonucleotides shown in SEQ ID Nos. 2-16. 

In accordance with the present invention, there may be 
25 employed conventional molecular biology, microbiology, and 
recombinant DNA techniques within the skill of the art. Such 
techniques are explained fully in the literature. See, e.g.. 
Sambrook, Fritsch & Maniatis, "Molecular Cloning: A Laboratory 
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Manual (2nd Ed.)", (1989); "DNA Cloning: A Practical Approach," 
Volumes I and 11 (D.N. Glover ed. 1985); "Oligonucleotide 
Synthesis" (MJ. Gait ed. 1984); "Nucleic Acid Hybridization" [B.D. 
Hames & S.J. Higgins eds. (1985)]; "Transcription and Translation" 
5 [B.D. Hames & S.J. Higgins eds. (1984)]; "Animal Cell Culture" [R.I. 
Freshney, ed. (1986)]; "Immobilized Cells And Enzymes" [IRL Press, 
(1986)1; B. Perbal, "A Practical Guide To Molecular Cloning" 
(1984). Therefore, if appearing herein, the following terms shall 
have the definitions set out below. 

10 A "DNA molecule" :cfers to the polymeric form of 

deoxyribonucleotides (adenine, guanine, thymine, or cytosine) in 
its either single stranded form, or a double-stranded helix. This 
term refers only to the primary and secondary structure of the 
molecule, and does not limit it to any particular tertiary forms. 

15 Thus, this term includes double-stranded DNA found, inter alia, in 
linear DNA molecules (e.g., restriction fragments), viruses, 
plasmids, and chromosomes. In discussing the structure herein, 
the normal convention of giving only the sequence in the 5* to 3' 
direction along the nontranscribed strand of DNA is used (i.e., the 

20 strand having a sequence homologous to the mRNA). 

A DNA "coding sequence" is a double-stranded DNA 
sequence which is transcribed and translated into a polypeptide in 
vivo when placed under the control of appropriate regulatory 
sequences. The boundaries of the coding sequence are determined 

25 by a start codon at the 5' (amino) terminus and a translation stop 
codon at the 3* (carboxyl) terminus. A coding sequence can 
include, but is not limited to, prokaryotic sequences, cDNA from 
eukaryotic mRNA, genomic DNA sequences from eukaryotic (e.g.. 
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mammalian) DNA, and even synthetic DNA sequences. A 
polyadenylation signal and transcription termination sequence will 
usually be located 3' to the coding sequence. Transcriptional and 
translational control sequences are DNA regulatory sequences, 
5 such as promoters, enhancers, polyadenylation signals, 
terminators, and the like, that provide for the expression of a 
coding sequence in a host cell. A "cDNA" is defined as copy-DNA 
or complementary-DNA, and is a product of a reverse 
transcription reaction from an mRNA transcript. 

10 A "promoter sequence" is a DNA regulatory region 

capable of binding RNA polymerase in a cell and initiating 
transcription of a downstream (3* direction) coding sequence. 
Within the promoter sequer>.> will be found a transcription 
initiation site, as well as protein binding domains (consensus 

15 sequences) responsible for the binding of RNA polymerase. 

As used herein, the terms "restriction endonucleases" 
and "restriction enzymes" refer to enzymes which cut double- 
stranded DNA at or near a specific nucleotide sequence. 

The term "oligonucleotide" is defined as a molecule 

20 comprised of two or more deoxyribonucleotides, preferably more 
than three. Its exact size will depend upon many factors which, in 
turn, depend upon the ultimate function and use of the 
oligonucleotide. The term "primer" as used herein refers to an 
oligonucleotide, whether occurring naturally as in a purified 

25 restriction digest or produced synthetically, which is capable of 
acting as a point of initiation of synthesis when placed under 
conditions in which synthesis of a primer extension product, which 
is complementary to a nucleic acid strand, is induced, ue., in the 
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presence of nucleotides and an inducing agent such as a DNA 
polymerase and at a suitable temperature and pH. The primer may 
be either single-stranded or double-stranded and must be 
sufficiently long to prime the synthesis of the desired extension 
5 product in the presence of the inducing agent. The exact length of 
the primer will depend upon many factors, including temperature, 
source of primer and the method used. For example, for 
diagnostic applications, depending on the complexity of the target 
sequence, the oligonucleotide primer typically contains 15-25 or 

10 more nucleotides to maintain specificity, although it may contain 
fewer nucleotides. 

Primers are selected to be "substantially" 
complementary to different strands of a particular target DNA 
sequence. This means that the primers must be sufficiently 

15 complementary to hybridize with their respective strands. 
Therefore, the primer sequence need not reflect the exact 
sequence of the template. For example, a non-complementary 
nucleotide fragment may be attached to the 5* end of the primer, 
with the remainder of the primer sequence being complementary 

20 to the strand. Alternatively, non-complementary bases or longer 
sequences can be interspersed into the primer, provided that the 
primer sequence has sufficient complementarity with the sequence 
or hybridize therewith and thereby form the template for the 
synthesis of the extension product. 

25 As used herein, the term "allele" is defined as the 

combination of polymorphic nucleotides at a given site on a 
chromosome. 
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As used herein, the term "genotype" is defined as the 
combination of two alleles in a given individual. 

As used herein, the term "sequence-specific 
amplification" refers to PGR using sequence-specific primers that 
only produce amplicon(s) when the target sequence is present. 

The following examples are given for the purpose of 
illustrating various embodiments of the invention and are not 
meant to limit the present invention in any fashion: 

KYAMPT.FI 

Human suhjects 

Subjects from 2 established cohorts were studied; one 
in Kigali, Rwanda, and the other in New York City. 
Sociodemographic characteristics and epidemiological findings 
have been described elsewhere for these cohorts (20-23). The 
Kigali cohort (20,21) included 202 seropositive women with 
adequate follow-up since 1986. One of these subjects did not yield 
complete genotyping data and was excluded from further analysis. 
The cohort additionally included 77 seronegative women. 
Categorization of subjects were based on their clinical outcomes in 
combination with laboratory findings as defined by the Kigali 
staging system (21). Three subgroups of Kigali's were established: 
15 rapid progressors (RPs) who died from AIDS in <6 years; 100 
slow progressors (SPs) who remained asymptomatic for >10 years, 
and subjects that could not be classified definitively based on 
disease progression were treated as indeterminants (n=53) and 
combined with intermediate progressors (n=33) who had less 
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extreme clinical outcome in the period of 6-10 years after 
seroconversion. 

The clinical and epidemiologic categorization of the 
subset of heterosexual HIV- 1 -positive Rwandan women available 
5 for genetic studies have been summarized. For the 7 7 
seronegative and the 201 seropositive women, the following risk 
factors were analyzed: 1) currently living alone but having 
multiple sexual partners or currently living with a single partner 
but both partners having extramarital affairs; 2) history of sexually 
10 transmitted disease in the past five years; 3) having multiple 
lifetime sexual partners; and 4) no condom use during sex in the 
last 2 years. 

The Bronx, New York cohort consisted of subsets of 
injecting drug users recruited from three ethnic groups, including 

15 58 Caucasians, 135 Hispanics, and 35 African- Americans (22,23). 
From the New York cohort of injecting drug users, 141 individuals 
were selected who remained seronegative despite high risk (based 
on a combination of the following non-exclusive risk behaviors) 
and 87 who became infected despite relatively little such risk or 

20 exposure: 1) having sex with another injecting drug user; 2) 
sharing needles in shooting galleries; 3) attending shooting 
galleries; and 4) sharing needles with strangers. Men having sex 
with other men was another risk factor observed in the overall 
New York cohort, but not in the subset of subjects studied here. 

25 Studies in both cohorts conformed to the procedures for informed 
consent approved by local and/or sponsoring institutional review 
boards. 
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FYAMPf F l 

Primatfi species 

Non-human primates, including 2 chimpanzees {Pan 
troglodytes)^ 2 pig-tailed macaques (Macaco nemestrina), and 2 
5 sooty mangabeys (Cercocebus atys) were studied for their CCR5 
promoter polymorphisms. Their CCR5 promoter sequences were 
amplified by PC!R and analyzed by automated sequencing as for 
human samples. 

10 EXAMPLE 3 

fifinnryping matpnals 

DNA samples used throughout the study were extracted 
from whole buffy coats or precipitates of cervicovaginal fluids, 
using the standard salting out procedures (24,25) and QIAamp 
15 blood kit (QIAGEN Inc., Chatsworth, Calif., USA), respectively. 

EXAMPLE 4 

Genotyping procedures 

Genotyping of CC/?5- A32 by PGR amplification size 

20 polymorphism and CCRlb-Ml by PCR-RFLP was performed (1,4). 
Typing of CCR5 promoter variants was initially achieved through 
automated sequencing of PCR-amplified products. The CCR5-^32 
and CCR2b-641 are approximately 2 kb downstream and 12.7 kb 
upstream, respectively, from the CCR5 downstream promoter that 

25 spans nucleotide 59052G (in the extended loci as defined by 
GenBank sequence U59626) to 59530C (13). 

Typing of CCR5 promoter variants was initially achieved 
through automated sequencing of PCR-amplified products 
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corresponding to nucleotide position 5901 2G to 59943G. Briefly, 
the design of allele-specific PCR amplification primers took 
advantage of the dimorphic site at nucleotide position 59029 
(relative to Genbank accession number U95626), 20 bp upstream 
5 from the first nucleotide of the minimum CCR5 promoter segment 
(14). Two separate PCR reactions were performed for each 
sample: the first using the 59029G-specific forward primer 
(nucleotides 59012G^59029G) and common reverse primer 
(CCR5P-COM3N, 59925G«-59943G), the second using the 59029A- 
10 specific forward primer (59012G^59029A) and primer CCR5P- 
COM3N. 

PGR conditions were optimized to allow sequence- 
specific amplification determined by the 59029G/A-specific 
primers: each 12.5 jil PGR consisted of IX buffer C (60 mM Tris- 

15 HCl (pH 8.5), 15 mM (NH4)2S04. 2.5 mM MgClj), 0.3 units of 
AropliTaq polymerase, together with 0.5 \iM of each primer, 80 ng 
genomic DNA, 0.2 mM each of dGTP, dCTP, dTTP and dATP. The 
PGR mix was subjected to 10 cycles of denaturing at 95°C for 25 
sec, annealing at 59''C for 40 sec, and extension at 72''C for 50 sec, 

20 followed by 23 cycles of denaturing at 95''C for 25 sec, annealing 
at 55''C for 40 sec, and extension at 72''C for 50 sec. Samples 
yielding a 932-bp amplicon were diluted 1:40 in TE buffer (10 mM 
Tris-HCl (pH 8.0), 2 mM EDTA) and 1.0 ^1 of the diluted product 
served as template for a 50-^1 second-round PCR with a 

25 biotinylated reverse primer (CCR5P-C0MR, 59709T<-59729T) and 
a forward Ml 3 (18 mer)-tailed primer (M13-CCR5P, 
59012G-^59028G). 
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After a further 22 cycles of denaturing at 95'*C for 25 
sec, annealing at 59»C for 40 sec, and extension at 72*C for 50 sec, 
the resulting products were bound to streptavidin-coated combs 
and single-stranded templates were generated and sequenced using 
5 a Cy5-labelled universal M13 primer (5'-ACA GGA AAC AGC TAT 
GAC-3'; SEQ ID No. 1), a Cy5-labelled internal primer (CCR5PSBT, 
59291T-»5931DA), and the ALFexpress autoload sequencing 
procedures (Pharmacia Biotech, Inc., Piscataway, NJ, USA). The 
allelic sequencing strategy outlined above resolved all known 

10 polymorphic sites in the constitutive downstream CC/?5 promoter 
region without cloning and its attendant problems due to PCR- 
introduced nucleotide substitution (26). The reliability of data 
generated by automated sequencing was further confirmed by 
sequencing the complementary strand. 

15 To expedite the genotyping procedures, 15 pairs of 

sequence-specific primers (SSP) were used to type the 5 major 
CCR5 promoter alleles simultaneously with the CCR5 and CCR2b 
variants (Tables 1 &. 2). The final conditions for SSP-based typing 
were as follows: 1) Each 10 \i\ PCR solution contained IX buffer C, 

20 50-70 ng of genomic DNA, 0.3 units of AmpIiTaq polymerase 
(Fisher Scientific, Norcross, GA, USA), 120 nM of each control 
primer, 250 nM each of specific primer, 0.4 mM each of dOTP, 
dCTP, dTTP and dATP, 10% (v/v) glycerol, and 0.02% cresol red; 2) 
PCR cycle reactions were performed on an UNO-thermoblock 

25 (Biometra, Inc., Tampa, FL, USA) for 10 higher-stringency cycles of 
denaturing at 94»C for 25 sec, annealing at eO^C for 45 sec, and 
extension at 72''C for 45 sec, followed by 21 additional lower- 
stringency cycles of denaturing at 94°C for 25 sec, annealing at 
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ST^C for 40 sec, and extension at 72°C for 40 sec; 3) all PCR 
reactions were done in 96-welI microliter plates, each capable of 
typing 6 individual samples; 4) before each cycle reaction, PCR 
began with a denaturing step at PS-'C for 2.5 min; 5) all PCR 
5 reactions included a final extension step at 72°C for 6 min; 6) half 
of each PCR reaction product was loaded directly onto a 1.7% 
agarose gel for electrophoresis. The SSP-banding patterns were 
recorded on photographs of ethidium bromide-stained gels. 

The use of PCR-SSP was limited to the identification of 

10 known genetic variants; novel alleles that may exist in different 
populations could either give rise to novel reaction patterns or be 
misclassified by the scheme used herein. The 67 samples from 
Kigali and New York typed by both sequencing and PCR-SSP 
demonstrated 100% consistency between the 2 methods (Table 3). 

15 Thus, new alleles were not encountered even when the number of 
genotyped samples increased. 
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TABf.F. 1 



10 



15 



20 



25 



QliPO nrimers llSfd for tvpinp major pnlymnrphi sms in PrPOh 
CCR5. and the CCUfi downstrpflm pr omorp.r rpgion 



5 QligO namfr Specificity" 



5'— »3' sequence 



Annealing SEQ 





riindftrlinft=no1vmornhir:^ 


position 


ID No 


/^/^"D^U Cite 


L^CKZO-Ot V 


tgggcaacatgctggtcg 


46278-^6295 


2 


CCR2bo/2S 




tgggcaacatgctggtca 


46278->46295 
61619^61636 


3 




general 


tggaaaataagggccacagac 


46670<-46690 


4 


CCR5P-5/1S 


59029G 


gagtggagaaaaaggggg 


59012-»59029 


5 


CCR5P-5/2S 


59Q29A 


gagtggagaaaaagggga 


59012-^59029 


6 


CCR5P-3/1S 


59353T;59356C 


agaatagatctctggtctgaaa 


59353<-59374 


7 


CCR5P-3/2S 


59353C 


agaatagatctctggtctgaag 


59353<-59374 


8 


CCR5P-3/3S 


59353T;59356T 


gagaatagatctctggtctaaaa 


59353<-59375 


9 


CCR5P-3/4S 


59356C 


tagagaatagatctctggtctg 


59356<-59377 


10 


CCR5P-3/5S 


59356T 


tagagaatagatctctggtcta 


59356<-59377 


11 


CCR5P-3/6S 


59402G 


agaatcagagaacagttcttcc 


59402<-59423 


12 


CCR5P-3/7S 


59402A 


agaatcagagaacagttcttct 


59402<- 5<^423 


13 


CCR5P.5/3S 


59653C 


caggaaacccatagaagac 


59635- >Jl>653 


14 


CCR5P-5/4S 


59653T 


caggaaacccatagaaga! 


59635-v5f?633 


15 


CCR5P-3/8G 


general 


gtgggcacatattcagaag 


59925<-59943 


16 


CCR5-SP4G 


general 


tcattacacctgcagctctc 


62004-»62023 


17 


CCR5.PM6G 


general 


tggtgaagataagcctcac 


62182^-62200 


18 



♦The numbering of nucleotide positions is based upon GenBank 
sequence U95626. 
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TARf K 1 

HrimeE mixes used in — PrR-SSP.ha. ftd typing of m^jnr 

polvmornhisms in the rCR?.h. rrRS and thP rr R s pmmmpr 



Primpir mivpg 

5 

1 a=CCR2-5/l S+CCR2-3/1 G 
lb=CCR2-5/2S+CCR2-3/lG 
bp 

2a=CCR5P-5/l S+CCR5P-3/1S 
1 0 2b=CCR5P-5/l S+CCR5P-3/2S 
2c=CCR5P-5/2S+CCR5P-3/l S 
2d=CCR5P-5/2S+CCR5P-3/2S 
2e=CCR5P-5/lS+CCR5P-3/3S 

1 5 3a=CCR5P-5/l S+CCR5P-3/6S 
3b=CCR5P-5/l S+CCR5P-3/7S 
3c=CCR5P-5/2S+CCR5P-3/6S 
3d=CCR5P-5/2S+CCR5P-3/7S 
4a=CCR5P-5/3S+CCR5P-3/8G 

20 4b=CCR5P-5/4S+CCR5P-3/8G 
5a=CCR5P-5/lS+CCR5P-3/4S 
5b=CCR5P-5/l S+CCR5P-3/5S 
Control= 

CCR5-SP4G+CCR5-PM6G 

25 



Specifirity* 


Size of PGR 




.amplicon 


CCR2b-64Val 


413 


bp 


CCR2b-64Ile; CCR5 


413; 585 


59029G-59353T 


JO J 


bp 


59029G-59353C 




hn 
Dp 


jyUZVA-jyjOi 1 3o3 


bp 


59029A-59353C 


363 


bp 


59029G-59353T- 


364 


bp 


59356T 




59029G-59402G 


412 


bp 


59029G.59402A412 


bp 


59029A-59402G412 


bp 




59029A-59402A412 


bp 




59653C 


309 


bp 


59653T 


309 


bp 


59029G-59356C 


367 


bp 


59029G-59356T 


367 


bp 


CCR5 






A3 2 


165 


bp 


WT 


197 


bp 



* The numbering of nucleotide positions is based upon GenBank 
30 sequence U95626. 
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TAW F. ^ 



10 



15 



20 



E xtended CCR5 downstream — promntftr fliiH^'g nht ainpH hy aii^ii p 

Sfint l ftnoinP of PCR prorlnrK HpHvpH frnm fi 7 Sftlftrtively gf>q..^nr-AH 

samplfts 



Allele 

designations 



59029 
sequence 



Combinations of 
polymorphic sites'* 



59653 
sequence 



SEQID 
No. 



CCR5P*0101 15 

CCR5P*0102 19 

CCR5P*0103 21 

CCR5P*0104 1 

CCR5P*0201 22 

CCR5P*02O2 56 



G 


T 

^59353 


■^59356' 


^59402 


'^59648 


(=P4) 


C 


24 


G 


^^59353' 


'^59356' 


•'^59402 


■^59648 


(=P2) 


C 


25 


G 


^59353" 


''^59356' 


^59402* 


^59648 


(=P3) 


c 


26 


G 


^59353" 


'^59356- 


^59402" 


^59648 


(=P2) 


c 


27 


A 


^59353" 


'^59356' 


'^59402' 


'^59648 


(=P1) 


c 


28 


A 


^59353' 


'^59356' 


'^59402*^59648 


(=P1) 


T 


29 



'Allelic sequences P*0101, P*0102, P*0103, P*0201 and P*0202 
have been deposited in GenBank with Accession numbers 
AF109379, AF109380, AF109381, AF109382. AF109383. 
respectively. Allele P*0104 was only observed once. "The 
numbering of nucleotide positions is based on GenBank sequence 
U95626. Polymorphism at position 59029 corresponds to the 
classification of an earlier study (5). 'This region has also been 
studied (6) and the alternative allele nomenclature is shown in 
parentheses. 



25 



RYAMPf F! g 

Tjnkage analvsis of PRnfttir variants at thp rrR'> h rrps ^ and rrp*: 

prnmnrer Inri 

30 Linkage between allelic variants at different loci are 

typically estimated with data from . families (27), which were not 
available in the cohorts studied herein. Instead, "apparent" 2- 
locus linkage was determined by tests for a 2 x 2 table 
containing the numbers of individuals with both, one without the 

35 other, or neither of the 2 markers in question (28). A p value 
documenting a significant association of two alleles (i.e., 
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suggesting linkage disequilibrium) did not eliminate the possibility 
of their coincident occurrence on opposite chromosomes. 
Conversely, fully reliable two-locus haplotypes could be 
established only when alleles at one of the two loci were 
5 homozygous, and only such haplotypes were accepted for analysis. 

Phylogenetic analyses of CCRS prnmnfft r sftqiiftnni*R 

The phylogenetic relationships among allelic sequences 

10 from human and non-human primates species were analyzed using 
software in the PHYUP (29) and PAUP (30) packages. All CCR5 
promoter sequences were aligned manually, with gaps introduced 
to increase alignment of sequences from primates. The final data 
set contained sequences spanning from nucleotide position 59029 

15 (in the extended loci as defined by GenBank sequence U59626) to 
59940. 

EXAMPLE? 

Gftnntypinp of rrR2h>64I hy PTR^RRP anH PTR wUh spqiiftnr<>> 
20 gpftfiifir primftfR (.^SP) 

The CCR2b-64 polymorphisms were determined by PCR- 
RRP based on an amplification-created restriction site in the 129- 
bp fragment (4) corresponding to nucleotide positions 46275 to 
46403 (relitive lo U95626). The CC/?2Z?-64I-bearing sequence 
25 could be digested by BsaBl into two fragments of 18 bp and 1 1 1 
bp, while the sequence carrying the CCR2b'64W was refractory to 
digestion. To differentiate the CCR2b-64W from CC/?2*-64I, PCR- 
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SSPwas used in two separate PGR reactions with both control and 
sequence-specific primer sets. 



RVAMPIFk 

5 GRnotvninp of rCR5-A32 hv PCR amplificarinn si7f> pnlymnrphi<^m 

PGR amplification of the CCR5 coding sequences 
corresponding to nucleotide positions 62004-62183 (relative to 
U95626) differentiated the CCi?5-A32 and wild type alleles (1), 
producing a 180-bp amplicon for the wild type sequence and a 
10 148-bp product for the deletion sequence. Additional primers 
were also used to amplify the deletion region for sequencing and 
size differentiation in a PCR-SSP format. 



EXAMPf.K9 

15 Simultaneous penotyping of TCR^h, rrRS^ and CCR^ prnmnfpr 
polymnrphisniR h^sftd nn PCR-SSP 

To facilitate simultaneous PCR-SSP-based typing of the 
dimorphisms CC/?2fe-64V/I, CCR5 wild type/CC/?5-A32, and CCR5 
promoter alleles P*0101, P*0I02, P*0103, P*0201 and P*0202, a 

20 panel of 21 individual primers were selected based upon a) similar 
melting temperatures; b) no mispriming; c) no strong secondary 
structure; d) no homodimer/heterodimer formation. The first 1 2 
primer mixes were adequate for the routine genotyping of each 
sample. Three additional primer mixes were required to confirm 

25 samples carrying CCR5P*0103, which gave weak positive signals in 
reaction 2a due to the presence of 59356T. Samples whose CCR5 
promoter variants were genotyped by sequencing and whose 
CCR2b dimorphism was genotyped by PCR-RFLP (4) served as 
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reference materials for testing the reliability of the PCR-SSP assay. 
Different thermostable DNA polymerases including Amplira^ 
(Promega, Madison, WI, USA) and Platinum Tag (Life Technologies. 
Gaithersburg, MD, USA) yielded comparable data sets, showing 
100% consistency between PCR-SSP and sequencing and between 
PCR-SSP and RFLP. 

Using the PCR-SSP format, the CC/?5 genotype could be 
assigned according to the size of the SSP control product amplified 
by primers flanking the 32-bp deleted region. Confirmatory 
product was also available in SSP reaction lb: primer CCR2b-5/2S 
(also CC/?2i»-64I-specific) and primer CCR5-PM6G (CCi?5-general) 
in this multiplex reaction amplify a larger fragment spanning the 
32-bp deletion region in CCR5. As a result, the undeleted wild type 
CCR5 sequence gave rise to a 197 bp product in all 12 reactions as 
well as a 585 bp product in reaction lb, while the CCJ?5-A32 
genotype produced 165 bp and 553 bp amplicons in the 
corresponding reactions. The CC/?5-specific product from reaction 
lb was also tested by RFLP for the presence of the CC^5-m303 
polymorphism (45), which were uncommon or absent in other 
populations (1.43,48) as well as in the New York and Rwandan 
subjects tested here. 



25 



KYAMPT F 1ft 

Statistical analygis 

Kigali and New York subjects were analyzed separately; 
New York subjects belonged to three , ethnic groups and, as such, 
were analyzed separately and jointiy. Standard procedures in 
Epilnfo software (33) were used to calculate and odds ratio 
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calculations with 95% confidence intervals using differential 
distribution of genetic factors in subjects with varying rates of 
HIV-1 disease progression, to measure apparent linkage 
disequilibrium among CCR variants, and to evaluate associations of 
5 genetic and non-genetic risk factors with HIV-1 infection. Multiple 
logistic regression analysis by SAS (SAS Institute, Inc., Cary. NC) 
was performed by incorporating 1) all CCR and SDFl variants; 2) 
ethnicity and age of participants; and 3) individual risk factors 
with significant associations with HIV-1 infection (Kigali) or 
10 individual risk factors used for selecting participants (New York). 
Multivariate analyses were also performed using a summary risk 
score, but this reduced model was less informative. Yates' 
corrected method or a two-tailed Fisher's exact test was applied to 
analyses with small numbers of observations. 

15 

EXAMPLE 11 

Tdf.nrification — anH fregenry of em's prnmoter alleleR anH nrliF>r 

relatftd markers 

Analyses yielded 6 different alleles and 15 genotypes in 

20 the cohort of Rwandan women, 9 of which were seen at a 
frequency higher than 5% (Table 4). Direct allele-level sequencing 
of the CCR5 promoter region spanning nucleotide 59029 to 59653 
of PCR-amplified human genomic DNA revealed 6 CCR5 promoter 
alleles: five were common and one was observed in only a single 

25 individual (Table 3). The 5 common alleles (not including the rare 
polymorphism at position 59648) could theoretically produce 1 5 
genotypes (allele combinations), 14 of which were observed in the 
60+ samples examined herein. The 5 major allelic variants were 
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defined by dimorphic sites at nucleotide positions 59029, 59353, 
59356, 59402, and 59653 relative to GenBank sequence U59626. 
By this definition, allele sequences P*0101 and P*0201 matched 
GenBank sequences U95626 and AF031237, respectively. 
5 The distributions of CCR2b, CCR5, and CCR5 promoter 

variants differed widely among distinct ethnic groups (Figure 3). 
Overall, 6 major ( •major' is defined herein as having a frequency 
higher than 5%) CCR5 promoter genotypes were detected in 
Hispanics, 7 in Caucasian- Americans, 9 in African- Americans, and 

10 10 in Rwandans (Figure 4). Several genotypes were apparently 
restricted to certain ethnic groups, but the small number of 
subjects in the African-American and Caucasian-American groups 
did not allovi reliable comparisons among populations surveyed 
here. Hispanic and non-Hispanic Caucasians resemble each other 

15 rather closely, and black African- Americans in New York and 
Rwandans also resemble each other in the distributions of 
promoter genotypic frequencies (Figure 4). In particular, allele 
P*0103 was largely restricted to populations of African origin. As 
a result, there were more CCR5 promoter genotypes (allele 

20 combinations) in Africans than Caucasians and Hispanics. Multiple 
linkages among CCR2b, CCR5, and CCR5 promoter variants were 
consistent in all populations (Tables 5 & 6). For example, 
exclusive linkage of promoter allele P*0202 (59653T) to CCR2b-64l 
was consistent with an earlier observation in Caucasians (2). 

25 Meanwhile, promoter allele P*0201 was the only variant that was 
linked to CCR5 -A32. Mutually exclusive linkages with P*0201 and 
P*0202 implied that CCR2b-641 and CCR5-A32 could not exist on 
the same chromosome, and the negative linkage between CCR2b- 
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641 and CCR5'A32, as revealed by the analysis herein, confirmed 
this association (or lack, thereoO (Table 5). 

Sequences from non-human primates revealed 
differences in 2 of 6 putative transcription factor binding elements 
5 within the CCR5 promoter (Figure 5A). As would be predicted, 
pair-wise genetic distance (Figure 5B) and phylogenetic analyses 
(Figure 5C) placed the chimpanzee CCR5 promoter sequence closer 
to all human CCR5 promoter alleles than to alleles from other 
primate species. The ancestral lineage of human CCR5 promoter 

10 alleles appeared to be P*^102, which shared identical sequences 
with other non-human alleles at nucleotide positions that define all 
major promoter alleles in humans (i.e., 59029, 59353, 59356, 
59402, and 59653). Moreover, P*0201 and P*0202 apparently 
diverged from the common ancestral sequence more recently than 

15 did other alleles. By the same analyses, polymorphisms at two 
positions, the G at 59029 and the T at 59353, appeared ancestral 
to any other lineages. 

Finally, in all human and non-human alleles, exclusive 
linkage between the G-to-A nucleotide substitution at position 

20 59029 to the T-to-C change at position 59356 presumably 
indicates either gene conversion or simultaneous point mutations. 
If the former is true, other upstream (CCR2b coding region) or 
downstream (CC/?5 coding region) polymorphisms are likely to be 
linked in the same fashion. The CC/?26-64I-59653T linkage served 

25 as one example, and CC/?5-A32-P*0201 linkage served as another. 
Thus, a disease association with an individual polymorphic site can 
actually signal relationships with one or more additional 
polymorphisms at multiple inter-linked positions. 
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TABLF. 4 

CCR2h nn(\ CCR*} mnmnter pftnotypfis nhspirvi^ rf in mv.t-infffrrf>fi 
Rwandan wnmp-n 

5 PrOmntPr genntypftS KxpftrtpH frpqn.»nry OhsPrvpH frftqiiPnry <;%^ 



P*0101/*0101 1.2 3.0 

P*0101/*0102 7.9 6.0 

P*0101/*0103 1.8 0.0 

P*0101/*0201 4.2 4.5 

10 P*0101/*0202 7.9 6.0 

P*0102/*0102 6.8 12.4 

P*0102/*0103 4.3 0 

P*0102/*0201 4.9 6.0 

P*0102/*0202 18.4 15.4 

15 P*0103/*0103 0.7 2.5 

P*0103/*0201 3.1 2.5 

P*0103/*0202 5.8 9.0 

P*0104/*0202 0.2 0.5 

P*0201/*0201 3.6 5.5 

20 P*02Gl/*0202 13.3 13.9 

P*0202/*0202 12.5 12^9 
CCR2b genotypes 

64V/I 40.8 42.3 

64I/I 8.2 7.5 
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TABLE 5 

Linkagfi among — CCR2h, CCR5, and rr/?5 prnmnrpr v aHanfg in 

siihjecr5s from the Bronx, New York^ 



Variant 1 Variant 2 


N " 


1+/2+ 


1+/2- 


1-/2+ 


1-/2- 


CK 


95% CI 


P 


CCR2b-641 CC/?J-A32 


193 


1 


53 


11 


128 


0.22 


0.03-1.74 


0.118 


CCR2b'64l CCR5 promoter 
















P*0101 


228 


20 


42 


109 


57 


0.25 


0.13-0.46 


0.001 


P*0102 


228 


10 


52 


46 


120 


0.50 


0.24-1.07 


0.071 


P*0201 


228 


23 


39 


99 


67 


0.40 


0.22-0.73 


0.002 


P*0202 ' 


228 


62 


0 


7 


159 


23.7 


11.5-49.0 


0.001 


59029G " 


228 


37 


25 


138 


28 


0.3 


0.16-0.58 


0.001 


CCR5-L32 P*0201 


193 


12 


0 


97 


84 


1.87 


1.63-2.14 


0.002 



^Subjects from Bronx included Caucasians (n=58), Hispanics 
(n=135), and African-Americans (n=35). •'African-Americans 
(n=35) are not included in the analysis involving CC/?5- A32, as this 
variant is not present in this ethnic group. *=Promoter allele P*0202 
exclusively carries 59653T (Table 3). **50029G is exclusively 



10 linked to 59353T, and both occur at equal frequency in this cohort 
regardless of the ethnic group. 
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TARI K It 

LinkaPft analysis of CCRlh-MJ and inrtiviHnal rrR<; pr^mnfPr all^lAc 
a nd nolvmornhic sifP.S in a rnhorl nf ? ? « Rwandan wnmPti* 



Variant 1 


Variant 2 


1+/2+ 


I+/2. 


1-/2+ 


1-/2- 


OR 


95% CI 


P 


CCR2b-64l 


Promoter 


variants 
















P*0101 


15 


115 


38 


110 


0.4 


0.20-0.73 


0.003 




P*0102 


33 


97 


87 


61 


0.2 


0.14-0.40 


0.001 




P»0201 


33 


97 


73 


75 


0.4 


0.21-0.58 


0.001 




P«0202 


129 


1 


19 


129 


875.8 


115.5-6639.7 


0.001 




59029G * 


67 


63 


126 


22 


0.2 


0.10-0.33 


0.001 



*C/?5-A32 is absent in this cohort. >»50029G is exclusively linked to 
59353T, and both occur at equal frequency in this cohort. 



KXAMPf F 11 

CCRS promnfftf fftnotypfts anH HTV-I infertmn 

10 Seropositive and seronegative individuals consistently 

differed in their distributions of two CCR5 promoter genotypes: 
the homozygous genotype P*0101/*0101 was more frequent in 
seropositives than in seronegatives (Table 7), while homozygous 
or heterozygous genotypes involving allele P*0201 were more 

15 frequent in seronegatives than in seropositives (Table 7). Despite 
the tight linkages that existed among alleles of CCR2b, the CCR5 
coding region, and the CCR5 promoter, »,he relationships between 
CCR5 promoter alleles and occurrence of HIV-1 infection could not 
be attributed to either CCR2b or variants in the CCR5 coding region 

20 (Table 8). Homozygosity and heterozygosity for SDFl-yA, a 
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polymorphism in the natural ligand of CXCR4 (54), were further 
excluded as a contributing factor. 

Several CCR5 promoter genotypes demonstrated trend 
for association in either combined groups, probably as a result of 
ethnically specific effects. For example, the homozygous promoter 
form P*0103/*0103 was found in 6 (or 2.8%) HIV-l-infected black 
individuals but was absent from seronegatives (OR = 5.8, p = 
0.097). This relationship was as strong as that recognized earlier 
for the P*0101/*0101 genotype. 

The promoter ?nele P*0103 was largely restricted to 
blacks. Thus, the protective genotypes involving P*0201 could be 
split into 5 major genotypes (P*0101/*0201, P*0102/*0201, 
P*0103/*0201, P*0201/*0201, and P*0201/*0202) in black 
populations, but genotype P*0101/*0103 was not detected. In the 
15 Caucasoid group, P*0201 can form 4 genotypes, and one of these 
(P*0 1027*0201) was rare. The re-analyses revealed that only 
genotypes P*0201/*0201 and P*0201/*0202 were associated with 
protection against infection in Cauicasoids (OR = 0.5, p = 0.032), 
compared with P*0102/*0201, P*0103/*0201, and P*0201/*0201 
20 in blacks (OR = 0.3, p = 0.001). Another P*0201+ genotype 
P*0101/*0201 was equally distributed in seropositive and 
seronegative groups regardless of their ethnicity. Therefore, the 
relative contribution to protection by different genotypes 
involving allele P*0201 varied according to the ethnic background. 
25 The relationships observed on CCR5 promoter 

genotypes remained the same when two additional non-CCR factors 
known to influence the rates of HIV-l disease progression in this 
cohort were considered. Specifically, homozygosity at HLA class I 
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A or B or both was strongly associated with more rapid disease 
progression in these Rwandan women, while HLA B*57 was 
associated with slower disease progression. Data from 
combinations of promoter and HLA factors strongly suggest that 
5 the protective effects of promoter genotypes P*0101/*0201 and 
P*0101/*0202 are independent of the HLA factors, as were the risk 
effects of P*0201/*0201 and P*0102/*0201 (Table 10). 

The characterization of CCR5 promoter allelic 
sequences provided simple explanations for previous associations 

10 with disease progression (Figure 6) and allowed further 
predictions. First, alleles P*0201 and P*0202 were the only 
promoter variants carrying the previously recognized disease- 
accelerating Pl/Pl genotype (6). These two alleles carried 
exclusively 59029A and not the disease-delaying 59029G/G (5). 

15 As a result, Pl/Pl homozygotes and 59029G/G-carriers 
represented 2 mutually exclusive groups of promoter genotypes. 
The association of the PI /PI -carrying P*0201/*0201 with more 
rapid disease progression as revealed by our data supported and 
refined the known effects of Pl/Pl. 

20 Second, as noted above, P*0202 (59653T) is exclusively 

linked to CCR2b'64l (Tables 5 & 6), which has been associated 
with slower disease progression in cohorts of seroconverters 
(2,4,11,12). Thus, the relationship between Pl/Pl genotype and 
rapid HIV-1 disease progression must depend upon P*0201 and not 

25 the CC/?2i)-64Minked P*0202 (Figure 6B). The lack of association 
between CCR2b-641 and slower disease progression in Rwandan 
women was seemingly inconsistent with findings from another 
African cohort (34). However, exclusive linkage of 59653T to 
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CCR2b-64I (2), which in turn solely defines promoter allele 
P*0202, made these findings easily explainable. In other earlier 
studies, CCR2b'64l variant has been less clearly associated with 
slower disease progression in certain seroprevalent groups than in 
5 cohorts of seroconverters (2,4,10-12), perhaps as much because 
of their genetic heterogeneity as because of misclassification with 
respect to outcome. The contrasting relationships seen here with 
the CC/?2fc-64I-carrying P*0 10 1/*0202 (protective) and 
P*0201/*0202 (risk) indicated clearly that the CCR2b-64l effect 

10 can go either way dependending upon the promoter genotypes. As 
a result, the CCR2b'64l effect must be secondary to 
polymorphisms in the promoter, which can vary significantly 
among different ethnic groups. The association between the PI /PI 
variant and an accelerated course independent of CC/?2fe-64I (6), 

15 coupled with the preliminary demonstration of reduced promoter 
activity (5) in carriers of the 59029G variant that includes P*0101 
and P'*'0102, enhances the likelihood that the effects of promoter 
polymorphisms are independent of CCR2b'64l. 
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TAB! ,F 7 

D i strihntion of rr/?5 promoter gpnorypPQ in ^rhnir Afnranc ^r^rt 
ethnic ranragiang 



Promoter 
Genotypes 



HIV+ 
Blacks 



HIV- 
Blacks 



OR 

(P) 



Caucasoids Caucasoids 



HIV+ 



(n = 


213) 


(n=100) 






(n=75) 


3.3 (7) b 


0 


6.8 


(0.065) 


21.3 (16) 


6.6 


(14) 


9.0 (9) 






12.0 (9) 


4.7 


(10) 


6.0 (6) 






25.3 (19) 


5.6 


(12) 


5.0 (5) 






8.0 (6) 


12.2 


(26) 


10.0(10) 






0 


0.5 


(1) 


4.0 (4) 


0.1 


(0.038) 


?.3 (1) 


5.6 


(12) 


19.0(19) 


0.3 


(0.001) 


4.0 (3) 


15.0 


(32) 


10.0(10) 






^.0 (6) 


2.8 


(6) 


0 


5.8 


(0.097) 


0 


2.8 


(6) 


8.0 (8) 


0.3 


(0.042) 


2.7 (2) 


9.9 


(21) 


4.0 (4) 


2.6 


(0.054) 


1.3 (1) 


5.2 


(11) 


7.0 (7) 






6.7 (5) 


13.1 


(28) 


12.0(12) 






9.3 (7) 


12.2 


(26) 


6.0 (6) 


2.2 


(0.092) 


0 



HIV- 
(n=l 18) 



OR 

(P) 



P*0101/*0101 
P*0101/*0102 
P*0101/*0201 
P*0101/*0202 

P*0102/*0102 
P*0102/*0103 
P*01 027*0201 
P*0102/*0202 

P*0103/*0103 

P*0103/*0201 
P*0103/*0202 

P*020I/*0201 
P*0201/*0202 

P*0202/*0202 



3.1(0.010) 



7.6 (9) 
9.3 (11) 
28.8 (34) 
12.7 (15) 



1.7 (2) 
0 

3.4 (4) 

2.5 (3) 3.3(0.082) 

0 
0 

3.4 (4) 

15.3 (18) 0.4(0.073) 

14.4 (17) 

0.8 (1) 



genotype 
P*01 047*0202. 



individual in this group carried the rare 



Numbers are expressed as % (n). 



8NSDOCID <WO ait2857A2 I > 



57 



wo 01/12857 



PCT/US(K»/22255 



00 



c 

«3 



c 

c< 

c 

u 

o: 
u 

c 

c 

E 

U 

•c 
c 



C 



>- 

2 



c 

>- 



c 



c 



£ 

> 

CG 

tx) 

CJ 

c 



a 

E 
o 

U 



o 
c 

a. 



^ 00 



oo 

+ ^ 



0) 



V3 

c 

CO 
o 



+ <N 



CO 

u 



+ ^ 



o 



00 

en wo 



d o o 



o \o 



On , 



^ d 



CM 



oo 



O O 



ON ^ 



o;<=^ 

SOTT 



SO 
CM 

d 

m 

O o o 



NO 



O 

o ^ 

o d 



d 



5^ 

CN O CNl 



o 

CM* 



On 



m o • 
rsi w o 



CM 

vd<=> 



<N 

On o 
ON <N 

^ •? 

On 

o o 

?n q 
o^ 



ON 



to 



ON 



O ^ 



o 

§^ 

^ o 

On ^ 

000 

^ NO 

CO 

On 



2u 




o 
a 

ON 

n 
o 

ON 



+ 



+ 



8 



^ < 



cd 

=3 



CN 
O 
CM 

o 

O 

o 



o 

CN 
O 

o 

CN 
O 



o 

CN 

o 

♦ 



< 

On 
CN 
O 
On 



+ 



o 
o 

Cd 



CO 
O 



o 
a. 



o 



c 

> 

o 
> 



4> 
>% 

o 
c 
a> 

60 



< = 



cd 



o 

o 

ON 

E 
o 



4) 
> 

4> 



vo 

I 

CN 

U 
u 



T3 

c 
cd 



o 

* 

CN O 

2 s 

o 

o 



e 

o 

Cd 



Cd 
o 



+ .5 



^ ^ o 

o o 

s > s 

* 



0 
O 
cd 



73 
Cd 

d 



CN 
O 

o 
a. 



o 5? 



in 

U 
U 



^ 2 



•0 
c 



58 

BNSDOCID:<WO 0112e57A2 1 > 



wo 01/12857 



PCT/USOO/22255 



TARI.K g 

Difff.rantial — distrihiitinn — of rrffs prnmntftr gftnn»yp«>« in htv-I4. 

Rwandan wnmcn with varvinp rates of diseasft prog rp.s<;ion an^ a 
comparison — with prftvimisly rprngnizpH p rnmnfftr variantc anH th^ 
5 closely rfilatpri rrR7h ppnoryppg 

RPs» IPs» SPs» p for 

Observed genotypes (N =15) (N = 86) (N = 100) trend 
CCR5 promoter genotypes " defined here 

1. P*0101/*0201 0 2.3 (2) « 6.9 (7) 0.083 

2. P*0101/*0202 0 1.2 (1) 10.9 (11) 0.005 

1 + 2 0 3.5 (3) 17.8 (18) 0.001 

3. P*0101/*0102'' 13.3 (2) 5.8 (5) 5.0 (5) 0.318 
4. P*0101/*0103'':not found - - - - 

5. P*0101/*0101«« 6.7 (1) 1.2 (1) 4.0 (4) 
3 + 4 + 5 = 3 + 5" 20.0 (3) 7.0 (6) 9.0 (9) 

1 + 2 + 3 + 4 + 5= 20.0 (3) 10.5 (9) 27.0 (27) 0.030 
1+2 + 3 + 5" 

6. P*0102/*0202 26.7 (4) 18.6 (16) 11.0 (11) 0.058 

7. P*0201/*0201 13.3 (2) 7.0 (6) 3.0 (3) 0.072 

6 + 7 40.0 (6)25.6 (22) 14.0 (14) 0.007 

8. P*0202/*0202 6.7 (1) 14.0 (12) 13.0 (13) 



Promoter genotypes recognized previously 

59029G/G« 26.7 (4)20.9 (18) 26.0 (26) 

59029A/A' = Pl/Pl' 33.3 (5) 34.9 (30) 30.0 (30) 

59029G/A 40.0 (6)44.2 (38) 43.6 (44) 
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CCR2b genotypes 
CCR2b-64UV + 64I/I 53.3 (8) 50.0 (43) 49.0 (49) 

CCR2b-641/y 46.6 (7)43.0 (37) 41.0 (41) 

CCR2b-641/64l 6.7 (1) 7.0 (6) 8.0 (8) 

"The three groups of patients (see text) were classified according to 
their rates of disease progression: RPs=rapid progressors=deaths 
from AIDS in <6 years; SPs=slow progressors=no clinical 
manifestation of HlV-l-related diseases in >10 years; 
5 IPs=intermediate and indeterminant progressors with less extreme 
clinical outcomes. ''Fifteen major CCR5 promoter genotypes involving 
5 major alleles were defined in this cohort. Only those involving 
alleles that appeared to distribute disproportionally (p ^0.1) are 
shown. 'Number.'^ are expressed as % (n). ''All genotypes carry 
10 homozygous 59029G/G. -=590290/0 (5) are represented by 6 major 
genotypes in this cohort: P*0101/*0101, P*0101/*0102, 
P*0101/*0103, P*0102/*0102, P*0102/*0103, P*0103/*O103; none of 
them appeared to have biased distribution among the three groups of 
subjects. The PI genotype (6) consists of 2 distinct alleles (*0201 
15 and *0202) in this cohort as defined by the polymorphism at 
nucleotide position 59653 (GenBank U59626). 
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TABI.K 10 



Analyses of CnmhinationS of diseasp-related fanmr^ gpftrifirially 

recognized in HTV-1 seropositive Rwandan wam^n 



Promoter Presence or absence of 


RP& 


IPs 


SPs 


p for 


genotypes other recognized factors 


%(n) 


%(n) 


%(n) 


trend 


1 + 2 * + Hmz ^ Sit A, B, or both 


0 


0 


2.0 (2) 


0.193 


+ Htr " at A, B, or both 


0 


3.5 (3) 


16.0 (16) 0.002 




0 


0 


3.0 (3) 


0.109 


' B*57 


0 


3.5 (3) 


15.0 (15) 0.009 


- Hmz « - B*57 


0 


3.5 (3) 


13.0 (13) 0.010 


P*0201/*0201+Hmz " at A, B, or both 


6.7 (1) 


1.2 (1) 


0 


0.037 


+ Htr * at A, B, or both 


6.7 (1) 


5.8 (5) 


3.0 (3) 


0.327 


+ B*57 


0 


1.2 (1) 


0 


0.500 


- B*57 


13.3 


5.8 (5) 


3.0 (3) 


0.096 




(2) 








- Hmz » - B*57 


6.7 (1) 


4.7 (4) 


3.0 (3) 


0.378 


P*0102/*0202+Hmz " at A, B, or both 


6.7 (1) 


7.0 (6) 


3.0 (3) 


0.250 


+ Htr " at A, B, or both 


20.0 


11.6(10) 


8.0 (8) 


0.155 




(3) 








+ B*57 


0 


3.5 (3) 


2.0 (2) 


0.934 


- B*57 


26.7 


15.1 (13) 


9.0 (9) 


0.045 




(4) 








- Hmz • - B*57 


20.0 


10.5 (9) 


6.0 (6) 


0.070 




(3) 









• 1 + 2 = P*0101/*0201 and P*0101/*0202, both involving the 
5 59029G-carrying P*0101. " Hmz, homozygosity; Htr, heterozygosity. 
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Age, ethnicity and sexual and drug-using habits usually 
alter the risk of acquiring HIV-1 infection. For example, among 
heterosexual women in Kigali, the observed high risk factors 
included: 1) currently living alone but having multiple sexual 
5 partners or currently living with a single partner but both partners 
having extra-marital affairs (OR = 1.8, p = 0.024); 2) history of 
sexually transmitted disease in the past five years (OR = 1.5, p = 
0.022); 3) having multiple life-time sexual partners (OR ^ 1.7, p = 
0.004); and 4) no condom use during sexual intercourse in the last 2 

10 years (OR = 8.6, p = 0.0001). Risk factor 1 was strongly correlated 
with risk factor 2 among the Rwandan women (x^ = 20.3, p = 0.001). 
Accordingly, the more strongly related risk factor 2 was chosen for 
additional analyses. For injecting drug users in New York, high risk 
factors included: 1) having sex with IV drug users (OR = 2.6, p = 

15 0.039); 2) sharing needles in shooting galleries (OR = 1.3, p = 0.006); 
3) attending shooting galleries (OR = 1.5, p = 0.065); and 4) sharing 
needles with strangers (OR = 1.01, p = 0.052). 

Univariate analysis revealed that patterns of behavior 
(with regard to sex in Kigali and both sex and drug injection in New 

20 York) represented the strongest risk of getting infected. An ordinal 
scoring scheme summarizing these behavioral factors demonstrated a 
strong association with risk in the Kigali, but not in the New York 
cohort, suggesting that the latter group was strongly biased as a 
result of selective enrollment. Stratification of subjects by behavioral 

25 risk score revealed that CCR5 promoter allele P*0201 conferred more 
stable protection on subjects with the lowest risk score than on those 
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with higher scores in Kigali ( high risk OR = 0.4, 95% CI = 0.0-7.2; low 
risk OR = 0.4, 95% CI = 0.2-0.7). In contrast, individuals with 
intermediate to higher risk scores were better protected by P*0201 in 
New York (high risk OR = 0.7-0.9; low risk OR = 0.3-0.6) (Table 11). 
5 The risk associated with P*0101/*0101 genotype diminished in the 
New York cohort. 

To assess the simultaneous and independent genetic and 
non-genetic effects, variables for age, behavioral risk, ethnicity and 
the two contributing CC/?5 promoter genotypes (P*0101/P*0101 and 

10 P*0201) were incorporated into logistic regression analyses 
performed on separate data from Kigali and New York populations 
(Table 11). Age and risk score consistently served as independent 
determinants of HTV-l infection in both cohorts. In Kigali, CCR5 
promoter genotypes involving P*0201 were associated with decreased 

15 risk; in New York, the P*0101/*0101 genotype was associated with 
increased risk. Modest and inconsistent interaction between age and 
risk score and between ethnicity and CC/25 promoter variants did not 
contribute significantly to risk in the regression model. The two 
genotypes associated with contrasting outcomes of infection in the 

20 two cohorts are mutually exclusive of each other (Table 3), and their 
reciprocal effects are in agreement with apparent differences in their 
frequencies in the two populations (Figure 4). Thus, it became 
evident that associations of the two promoter genotypes 
P*0101/P*0101 and P*0201 with higher and lower risk of HIV-1 

25 infection, respectively, were independent of the effects of 
demographic and behavioral factors. 
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TABLE 11 

Univariate and multivariate regression analy^ fts <;hnwiiig the 

independent — and relative cnntrihntinn t o HTV-l inferttnn hy genetir 
and non-genetic factors in two rohorts 



i. i^ogisuc moaei lor univariate analyses oi tne 


New 


York data set 




Factors cniisid(>red 




OR (95% CI) 


A 


Having sex with IV drug user 




2.6 


(1.0-6.3) 


0.039 


Sharing needles in shooting galleries 




1.3 


(1.1-1.6) 


0.006 


Attending shooting galleries 




1.5 


(1.0-2.4) 


0.065 


Sharing needles with strangers 




1.0 


(1.0-1.0) 


0.052 


Age 




0.9 


(0.9-0.9) 


0.005 


Ethnicity 










Caucasian Americans 




0.6 


(0.3-1.1) 


0.1 10 


African Americans 




0.8 


(0.4-1.8) 


0.609 


Hispanics 




1.7 


(1.0-2.9) 


0.073 


CCR2b-64l 




0.9 


(0.5-1.7) 


0.840 


CCR5-D32 




1.7 


(0.5-5.3) 


0.390 


CCR5 promoter P*0101/*0101 




3.6 


(1.5-8.4) 


0.004 


CCR5 promoter P*0201 




0.5 


(0.3-0.9) 


0.020 


SDF]-yA 




1.5 


(0.8-2.8) 


0.173 


II. Multivariate analyses of the New York data set 








HIV-1 infection = 1.60 + 0.31 x (sharing needles) - 0.06 x age - 


0.83 X 


(being white) + 1.41 x (CCRS promoter P*0101/*0101) 




Sharing needles OR = 


1.4, CI = 


1.1-1.7, p 


= 0.003 


Age OR = 


0.9, CI = 


0.9-1.0, p 


= 0.014 


Being white OR = 


0.4, 


CI = 


0.2-0.9, p 


= 0.021 


CCRS P*0101/*0101 OR = 


4.1. 


CI 


= 1.7-10.2, 


P = 


0.002 








III. Logistic model for univariate analyses of the Kigali data set 




Factors r.onsiHftreH 




OR (95% CT) 




Venereal disease in past 5 years 




2.5 


(1.2-5.1) 


0.016 


Condom use 2 yrs prior to serocojwersiou 0.02 


(0.0-0.1) 


0.000 


Number of lifetime sexual partners 




1.5 


(1.09-2.0) 


0.011 


Age 




0.9 


(0.83-0.9) 


0.000 


CCR2b-64l 




1.6 


(0.91-2.6) 


0.108 
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CCR5 promoter P*0101/*0101 999.0 (0-0.900 

999.0) 

CCR5 promoter P*0201 0.4 (0.3^0.7) 0.001 

SDFl-yA 1.35 (0.65-2.81)0.417 

IV. Multivariate analyses of the Kigali data set 

HIV-1 infection = 7.36 + 1.18 x (venereal disease) - 4,54 x (condom 
use) - 0.09 X age - 0.87 x (all CCR5 promoter genotypes involving P*0201) 

Venereal disease in past 5 years OR=3.2,CI=1.4-7.7,p =0.007 

Condom use 2 yrs prior to seroconversion OR=0.02,CI=0.0-0,l,/> 

=0.000 

Age OR=0.9,CI=0.8-1.0,p =0.024 
CCR5 promoter P*0201 OR=0.4,CI=0.2-0.8,/7 =0.011 



KXAMPf.E 14 

5 The Ohscrvcd relationships between rr^5 prnmotftr pn lymorphigiTig 

and HIV-1 infection differed frnm the previoiiRly re gngnireH 

associations 

To compare previously reported relationships of CCR5 
promoter sequences to progression with their relationships to 

10 infection seen here, the CCR5 promoter alleles were further 
aggregated into groups as defined by earlier methods. The effects of 
the P*0202-linked CCR2b'64I variant were also tested. By this re- 
analysis, the 59029G/G variant previously associated with slower 
disease progression and the Pl/Pl genotype previously associated 

15 with faster disease progression separated mutually exclusive subsets 
of the promoter genotypes, explaining their opposing effects. 
However, the previously recognized relationships between these two 
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CCR5 promoter variants and HIV-1 disease progression almost 
contradicted their relationships to infection. For example, the 
59029G/G (= all combinations involving only P*0101, P*0102, P*0103, 
and P*0I04) was previously associated with slower disease 
5 progression (5). In the population cohorts described herein, 
P*0101/P*0101 (a subset of 59029G/G) showed unequivocal 
relationship to increased risk of infection (Table 8). Here, 
differences in population allele frequencies are unlikely to be a 
sufficient explaination. 

10 The effects ol the Pl/Pl genotype differed even between 

the two cohorts: in Kigali, the Pl/Pl genotype was associated with 
becoming infected, whereas in New York, the same genotype was 
associated with remaining uninfected. The effects of CCR2b-6Al were 
the least stable when the 4 ethnic groups were analyzed separately. 

15 These apparently self-contradictory results contrast starkly with the 
strength and consistency of the relationships established by the 
typing scheme used herein, which had a definitive resolution of the 
promoter genotypes (Table 3). Thus, the manisfestation of CCR 
relationships appeared to be sensitive to ethnicity and allele 

20 definitions. 
Summary 

Intensive effort has been devoted to measuring the 
influence of polymorphisms in CCR5 and its promoter on the rates of 
disease progression during HIV-1 infection. The almost complete 
25 absence of HIV-1 infection in CC/JJ-A32 homozygotes has highlighted 
the importance of this co-receptor for initial penetration by the 
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Virus. It is conceivable that reduced production of CC/?5 mediated by 
the promoter genotypes (5) can further determine the infectability 
by HIV-1. This aspect has heretofore escaped much of the attention 
surrounding the CCR5 promoter polymorphisms. 
5 Based on earlier data derived from various ethnic groups, 

the four disease-related promoter alleles P*0101, P*0102, P*0201 and 
P*0202 differ in their frequencies in different populations. In 
particular, the promoter allele P*0201 occurs more frequently in 
Caucasoids (Caucasians and Hispanics. 34%) than in populations of 
10 African origin (22%). If the P*0201/*0201 (a subset of Pl/Pl) truly 
mediates more rapid progression to disease following HIV-1 
infection, then a higher proportion of rapid progressors can be 
attributed to this genotype in Caucasians (11-12%) than Africans (4- 
5%). It would, therefore, be more difficult to detect the effect of 
15 P*0201/*0201 or Pi/Pl genotype in populations of African origin. 
The lack of association between Pl/Pl and rapid disease progression 
in African-Americans (6) reflects this difficulty, but a better 
resolution of the Pl/Pl genotype is likely to pinpoit the exact alleles 
involved. 

Despite the consistent associations between CCR5 
promoter variants and contrasting rates of HIV-1 disease progression, 
the mechanisms for these effects are still puzzling. For example, 
none of the polymorphic promoter sites map within any of the 6 
putative transcription factor-binding (TFB) elements (13,14), 
suggesting that other unknown neighboring factors in linkage 
disequilibrium are probably involved. 



20 
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In summary, the work described herein confirmed the 
association of CCR5 genotype Pl/Pl and further indicated that only 
the subset represented by the P*0201/*0201 genotype was the true 
factor in the Rwandan populatrion studied. Meanwhile, two 
5 promoter genotypes, P*0101/*0201 and P*0101/*0202, collectively 
and individually showed the opposite effects. The appearance and 
strength of the association of CCR2b'64I obviously depended on the 
relative frequencies of 2 competing CCR5 promoter genotypes, 
P*0102/*0202 and P*0201/*0202, both involving the CCR2b'64h 

10 carrying P*0202 allele. In addition, the disease-accelerating effect of 
P*0201 7*0201 is predicted to be more significant to Caucasian and 
Hispanic populations, whose P*0201 frequencies are higher than 
those of populations of African origin. 

The consistent effects associated with CCR5 promoter 

15 alleles in ethnically diverse populations studied herein are less 
dramatic compared with those associated with CCR5'A32 and CCR5- 
m303. It is unlikely that a simple mechanism can explain all of the 
observed relationships. The exisitng data pool already points to a 
complex phenomenon that depends on a combination of differences 

20 in promoter activities (5), in splicing of CCR5 transcription products 
(13), and in the relative prevalence of M- and T-tropic quasispecies 
at different stages of HIV-1 infection (40,55). There is also limited 
evidence that the co-receptor usage by HIV-1 may be governed not 
only by CCR5 expression (37,52), but also by the ratio between CCR5 

25 coreceptor and CD4* on the cell surface (19). If the relative 
expression of CCR5 and CD4* is altered after infection, the effects 
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exerted by a promoter genotype at the inception of HIV-1 infection 
could appear very different from those seen at later stages. 

There are three single nucletide changes between the 
promoter alleles P*0101 and P*0201, however none of them occur in 
5 any of the six putative transcription factor-binding elements. If the 
different effects associated with these two alleles have anything to do 
with differential transcription of CCR5, there are probably additional 
unrecognized elements in the promoter region. Meanwhile, two of 
the three nucleotide changes between CCR5 promoter alleles P*0101 

10 and P*0201 are located in the untranslated region (exon 1) of CCR5 
mRNA. These sites (59353 and 59402) may alter the translation 
rather than transcription. The relatively high frequency of promoter 
alleles P*0101 and P*0201 in different ethnic groups suggested that 
these factors can be more significant than the two Caucasian-specific 

15 factors in the CCR5 locus previously shown to mediate HIV-1 
transmission. 
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TABI.F. 12 

Alldf, assipnniRnt for CC.K?h rrR5 anrl cms p rnnint/>r allplf^g hac«.H 
upon PrR.SSP 





Genotypfts 


DNA band(s) 


Notes 


5 




visualized in 








SSP reaction 






CCR2b-64V 


la=413 


bp(++++) 


RHP applicable (Smith, 




1997) 






10 


CCR2b-64I 


Ih— 41 "\ 

X U— *T 1 O 




lb always has 4b=309 














CCR5 wild type 


1 D— JO J 


bp(+++) 


197 bp control product 




CCR5-A32 






165 bp control product 




P*0101 


2a=363 


bp(++++); 




15 




3a=412 


bp(++++); 


extra 5a=367 bp(++++) 








Dp(++++) 






2a=3o3 


bp(++++); 








3b=412 


bp(++++); 


extra 5a=367 bp(++++) 


20 




4a=309 


bp(++++) 




P*0103 


2a=363 


bp(+/-); 








3b=412 


bp(++++); 


extra 2e=363 bp(++++). 






4a=309 


bp(++++) 


5b=367 bp(++++) 


25 


P*0201 


2d=363 


bp(++++); 






3d=412 


bp(++++); 


MA 






4a=309 


bp(++++) 






P*0202 


2d=363 


bp(++++); 


4b(+++) usually has 


30 




3d=412 


bp(++++); 


lb=413 bp(++++) 






4b=309 


bp(+++) 
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The intensity of specific products is measured as: ++++ very strong; 
+++ strong; ++ clearly visible; + weak; +/- weak or invisible. See also 
Figure 2. 



5 RYAMPf g 1g 

CCR .' ^ fiftnotypfs — Determinfl — Progrftssion nf htv-i inf^pHnn hy 

Re^ulatinp Rarly Viral T narj 

Up to 306 mostly Caucasians homosexual HIV-1 
seroconverters from the Multicenter AIDS Cohort Studies and District 
10 of Columbia Gay Cohort had viral load (VL) measurements 6-36 
months after seroconversion. Men were typed by multiplexing FCR 
with sequence-specific primers that simultaneously defined markers 
such as: CCR2b-64l and cw-regulatory dimorphisms G303A, T627C, 
C630T. A676G. C927T (including the Pl/Pl pairing reported in 

15 association with rapid progression), and A32 in CCR5 (Figure 7). 
Viral load (Chiron or Roche) was reported as log.o. The GLM 
procedure in SAS was used to model the relationships. 

Age-, race-, cohort-adjusted mean VL at a median of about 
15 months was 4.33. Overall A32 carriers showed clearly lower \L 

20 (4.06, p=0.008), 641 showed only a modest reduction (VL=4.21) and 
Pl/Pl showed no reduction (VL=4.38). In a multivariate model, 
compared with the reference group (n=224), men who carried A32 in 
the absence of the homozygous A-C-C-A-C haplotype (a subset of 
Pl/Pl) had lower VL both in the presence (3.38, p=0.03) and the 

25 absence (4.01. p=0.001) of the 641-linked A-C-C-A-C haplotype. 
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Conversely, men who carried the homozygous A-C-C-A-C haplotype in 
the absence of A32 had higher VL (4.52, p=0.05). The overall 
adjusted model for association of CCR genotypes with VL was highly 
predictive (p=0.0005). 
5 CCR5 genotype HHE/HHE (Figure 9) contributed the 

majority of effect on VL and disease progression, causing >2-year 
acceleration to the onset of AIDS in 10% of the participants. The 
infrequent CCR5 genotype carrying A32 (HHG*2) and 641 (HHF*2) had 
a strong protective effect (lower VL and longer time to AIDS) in 2% of 

1 0 the participants. CCR2-641 alone conferred slight but not significant 
protection against HIV-1 VL or disease progression. 

In conclusion, comprehensive genotyping of CCR2b and 
CCR5 regulatory and coding region variants coupled with analysis 
accounting for linkage disequilibria between the variants indicated 

15 that A32, the 641-linked A-C-C-A-T CC^5 promoter haplotype, and the 
homozygous A-C-C-A-C genotype collectively determine later disease 
progression partly by governing viral load early in infection. 
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Any patents or publications mentioned in this 
specification are indicative of the levels of those skilled in the art to 
which the invention pertains. Further, these patents and publications 
are incorporated by reference herein to the same extent as if each 
5 individual publication was specifically and individually indicated to 
be incorporated by reference. 

One skilled in the art will appreciate readily that the 
present invention is well adapted to carry out the objects and obtain 
the ends and advantages mentioned, as well as those objects, ends 

10 and advantages inherent herein. The present examples, along with 
the methods, procedures, treatments, molecules, and specific 
compounds described herein are presently representative of 
pre V erred embodiments, are exemplary, and are not intended as 
limitations on the scope of the invention. Changes therein and other 

15 uses will occur to those skilled in the art which are encompassed 
within the spirit of the invention as defined by the scope of the 
claims. 
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WHAT IS CLAIMED IS: 

1 . A method of surveying CCR genotypes in a 
population, comprising the steps of: 

(a) obtaining biological samples from a representative 
5 number of individuals in a population, wherein each sample is from a 

different individual, wherein said sample comprises genomic DNA; 

(b) combining a portion of each sample with an 
experimental primer combination and a control primer combination » 
thereby producing primer-annealed DNA, wherein said experimental 

1 0 primer combination is selected from the group consisting of SEQ ID 
Nos. 2 & 3, 2 & 4, 5 & 7, 5 & 8, 5 & 9, 5 it 10, 5 & 11, 5 & 12, 5 & 13, 
6 & 7, 6 & 8, 6 & 12, 6 & 13, 14 & 16 and 15 & 16, wherein s.ud 
control primer combination is SEQ ID Nos. 17 & 18; 

(c) amplifying said primer-annealed DNA in a reaction, 
IS thereby producing amplicons, wherein reaction conditions for said 

amplification are optimized for sequence-specific amplification, 
wherein each experimental primer combination and said control 
primer combination are predicted to produce one or more amplicons 
having expected sizes in basepairs; 

20 (d) separating said amplicons by size, wherein the 

presence of: 

a 197 bp amplicon with said control primer combination 
is indicative of a CCRS wildtype coding sequence; 
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a 165 bp amplicon with said control primer combination 
is indicative of a CCR5-A32 coding sequence; 

a 363 bp amplicon with experimental primer combination 
SEQ ID Nos. 5 & 7, a 367 bp amplicon with experimental primer 
combination SEQ ID Nos. 5 & 10. a 412 bp amplicon with 
experimental primer combination SEQ ID Nos. 5 & 12 and a 309 bp 
amplicon with experimental primer combination SEQ ID Nos. 14 & 1 6 
is indicative of a P*0101 CCR5 promoter allele; 

a 363 bp ampl'c-on with experimental primer combination 
SEQ ID Nos. 5 & 7, a 367 bp amplicon with experimental primer 
combination SEQ ID Nos. 5 & 10, a 412 bp amplicon with 
experimental primer combination SEQ ID Nos. 5 & 13 and a 309 bp 
amplicon with experimental primer combination SEQ ID Nos. 14 & 1 6 
is indicative of a P*0102 CCR5 promoter allele; 

^5 a 363 bp amplicon with experimental primer combination 

SEQ ID Nos. 5 & 7, a 363 bp amplicon with experimental primer 
combination SEQ ID Nos. 5 & 9, a 367 bp amplicon with experimental 
primer combination SEQ ID Nos. 5 & 11, a 412 bp amplicon with 
experimental primer combination SEQ ID Nos. 5 & 13 and a 309 bp 

20 amplicon with experimental primer combination SEQ ID Nos. 14 & 1 6 
is indicative of a P*0103 CCR5 promoter allele; 

a 363 bp amplicon with experimental primer combination 
SEQ ID Nos. 6 & 8, a 412 bp amplicon with experimental primer 
combination SEQ ID Nos. 6 & 13 and a 309 bp amplicon with 
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experimental primer combination SEQ ID Nos. 14 & 16 is indicative of 
a P*0201 CCR5 promoter allele; 

a 363 bp amplicon with experimental primer combination 
SEQ ID Nos. 6 & 8, a 412 bp amplicon with experimental primer 
5 combination SEQ ID Nos. 6 & 13 and a 309 bp amplicon with 
experimental primer combination SEQ ID Nos. IS & 16 is indicative of 
a P*0202 CCR5 promoter allele; 

(e) determining a CCR genotype for each sample based 
upon said CCR alleles indicated following step (d); and 

10 (f) compiling said genotypes determined in step (e), 

thereby genotyping said representative number of individuals in said 
population, thereby surveying CCR genotypes in said population. 

2. The method of claim 1, wherein said biological 
15 samples are selected from the group consisting of blood, serum, 

saliva, semen, tissue biopsy and DNA. 

3. The method of claim 1, wherein said separation is 
selected from the group consisting of gel electrophoresis and size 

20 fractionation. 

4. The method of claim 1, wherein said amplifying is 
by polymerase chain reaction (PCR). 
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5, A method of surveying HIV-1 co-receptor CCR alleles 
in an individual, comprising the steps of: 

(a) obtaining a biological sample from an individual, 
wherein said sample comprises genomic DNA; 

5 (b) combining a portion of said sample with at least one 

experimental primer combination and a control primer combination, 
thereby producing primer-annealed DNA, wherein said experimental 
primer combinations are selected from the group consisting of SEQ 
ID Nos. 2 & 3, 2 & 4, 5 & 7, 5 & 8, 5 & 9, 5 & 10, 5 & 11, 5 & 12, 5 & 
10 13, 6 & 7, 6 & 8, 6 & 12, 6 & 13, 14 & 16 and 15 & 16, wherein said 
control primer combination is SEQ ID Nos. 17 & 18; 

(c) amplifying said primer-annealed DNA in a reaction, 
thereby producing amplicons, wherein reaction conditions for said 
amplification are optimized for sequence-specific amplification, 

15 wherein each experimental primer combination and said control 
primer combination are predicted to produce one or more amplicons 
having expected sizes in basepairs; and 

(d) separating said amplicons by size, wherein the 
presence of: 

20 a 197 bp amplicon with said control primer combination 

is indicative of a CCR5 wildtype coding sequence; 

a 165 bp amplicoD with said control primer combination 
is indicative of a CCR5-Zi32 coding sequence; 
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a 363 bp amplicon with experimental primer combination 
SEQ ID Nos. 5 & 7, a 367 bp amplicon with experimental primer 
combination SEQ ID Nos. 5 & 10, a 412 bp amplicon with 
experimental primer combination SEQ ID Nos. 5 & 12 and a 309 bp 
amplicon with experimental primer combination SEQ ID Nos. 14 & 1 6 
is indicative of a P*0101 CCR5 promoter allele; 

a 363 bp amplicon with experimental primer combination 
SEQ ID Nos. 5 & 7, a 367 bp amplicon with experimental primer 
combination SEQ ID Nos. 5 & 10. a 412 bp amplicon with 
experimental primer combination SEQ ID Nos. 5 & 13 and a 309 bp 
amplicon with experimental primer combination SEQ ID Nos. 14 & 1 6 
is indicative of a P*0102 CCR5 promoter allele; 

a 363 bp amplicon with experimental primer combination 
SEQ ID Nos. 5 & 7, a 363 bp amplicon with experimental primer 
15 copibinaUon SEQ ID Nos. 5 & 9, a 367 bp amplicon with experimental 
primer combination SEQ ID Nos. 5 & 11, a 412 bp amplicon with 
experimental primer combination SEQ ID Nos. 5 & 13 and a 309 bp 
amplicon with experimental primer combination SEQ ID Nos. 14 & 1 6 
is indicative of a P*0103 CCR5 promoter allele; 

20 a 363 bp amplicon with experimental primer combination 

SEQ ID Nos. 6 & 8, a 412 bp amplicon with experimental primer 
combination SEQ ID Nos. 6 & 13 and a 309 bp amplicon with 
experimental primer combination SEQ ID Nos. 14 & 16 is indicative of 
a P*0201 CCR5 promoter allele; 
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a 363 bp amplicon with experimental primer combination 
SEQ ID Nos. 6 & 8, a 412 bp amplicon with experimental primer 
combination SEQ ID Nos. 6 & 13 and a 309 bp amplicon with 
experimental primer combination SEQ ID Nos. 15 & 16 is indicative of 
5 a P*0202 CCR5 promoter allele. 



6. The method of claim 5, wherein said biological 
samples are selected from the group consisting of blood, serum, 
saliva, semen, tissue biopsy ^nd DNA. 



10 



7. The method of claim 5, wherein said separation is 
selected from the group consisting of gel electrophoresis and size 
fractionation. 



8. The method of claim 5, wherein said amplifying is 
by polymerase chain reaction (PGR). 



9. A method of predicting the disease progression of 
AIDS in an HIV- 1 -infected individual, comprising the steps of: 

2^ (a) obtaining a biological sample from an individual, 

wherein said sample comprises genomic DNA; 

(b) combining a portion of said sample with at least one 
experimental primer combination and a control primer combination, 
thereby producing primer-annealed DNA, wherein said experimental 
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primer combinations are selected from the group consisting of SBQ 
ID Nos. 2 «& 3, 2 & 4, 5 & 7, 5 & 8. 5 & 9. 5 & 10. 5 & 11, 5 & 12, 5 & 
13, 6 & 7. 6 & 8. 6 & 12, 6 & 13, 14 & 16 and 15 & 16, wherein said 
control primer combination is SEQ ID Nos. 17 & 18; 

5 (c) amplifying said primer-annealed DNA in a reaction, 

thereby producing amplicons, wherein reaction conditions for said 
amplification are optimized for sequence-specific amplification, 
wherein each experimental primer combination and said control 
primer combination are predicted to produce one or more amplicons 
10 having expected sizes in basepairs; 

(d) separating said amplicons by size, wherein the 
presence of: 

a 363 bp amplicon with experimental primer combination 
SEQ ID Nos. 6 & 8, a 412 bp amplicon with experimental primer 
15 combination SEQ ID Nos. 6 & 13 and a 309 bp amplicon with 
experimental primer combination SEQ ID Nos. 14 & 16 is indicative of 
a CCR5 promoter genotype of P*0201/P*0201; 

a 363 bp amplicon with experimental primer combination 
SEQ ID Nos. 5 & 7, a 367 bp amplicon with experimental primer 

20 combination SEQ ID Nos. 5 & 10, a 412 bp amplicon with 
experimental primer combination SEQ ID Nos. 5 & 13, a 363 bp 
amplicon with experimental primer combination SEQ ID Nos. 6 & 8, a 
412 bp amplicon with experimental primer combination SEQ ID Nos. 
6 & 13, a 309 bp amplicon with experimental primer combination 

25 SEQ ID Nos. 14 & 16 and a 309 bp amplicon with experimental primer 



BNS00CID:<WO . 01128$7A2 I > 



82 



i 



wo 01/12857 PCT/USOO/22255 

combination SEQ ID Nos. 15 & 16 is indicative of a CCR5 promoter 
genotype of P*0102/P*0202; 

a 363 bp amplicon with experimental primer combination 
SEQ ID Nos. 5 & 7, a 367 bp amplicon with experimental primer 
5 combination SEQ ID Nos. 5 & 10, a 412 bp amplicon with 
experimental primer combination SEQ ID Nos. 5 & 12, a 363 bp 
amplicon with experimental primer combination SEQ ID Nos. 6 & 8, a 
412 bp amplicon with experimental primer combination SEQ ID Nos. 
6 & 13 and a 309 bp amplicon with experimental primer combination 
10 SEQ ID Nos. 14 & 16 is indicative of a CCR5 promoter genotype of 
P*0101/P*0201; 

a 363 bp aroplfcon with experimental primer combination 
SEQ ID Nos. 5 & 7, a bp amplicon with experimental primer 
combination SEQ ID Nos. 5 & 10, a 412 bp amplicon with 

15 experimental primer combination SEQ ED Nos. 5 & 12, a 363 bp 
amplicon with experimental primer combination SEQ ID Nos. 6 & 8, a 
412 bp amplicon with experimental primer combination SEQ ID Nos. 
6 & 13, a 309 bp amplicon with experimental primer combination 
SEQ ID Nos. 14 & 16 and a 309 bp amplicon with experimental primer 

20 combination SEQ ID Nos. 15 & 16 is indicative of a CCR5 promoter 
genotype of P*0101/P*0202; 

wherein a CCR5 promoter genotype of P*0201/P*0201 or 
P*0102/P*0202 is predictive of an accelerated rate of AIDS 
progression in said individual relative to an individual who does not 
25 possess said P*0201/P*0201 or P*0102/P*0202 gentoype, wherein a 
CCR5 promoter genotype of P*0101/P*0201 or P*0101/P*0202 is 
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predictive of a slower rate of AIDS progression in said individual 

relative to an individual who does not possess said P*0101/P*0201 or 
P*0101/P*0202 gentoype. 

10. The method of claim 9, wherein said biological 

samples are selected from the group consisting of blood, serum, 
saliva, semen, tissue biopsy and DNA. 



11. The method of claim 9, wherein said separation is 
10 selected from the group consisting of gel electrophoresis and size 
fractionation. 



12. The method of claim 9, wherein said amplifying is 
by polymerase chain reaction (PCR). 

15 13. A method of predicting the probability of HIV-1 

infection in an individual, comprising the steps of: 

(a) obtaining a biological sample from an individual, 
wherein said sample comprises genomic DNA; 

(b) combining a portion of said sample with at least one 
20 experimental primer combination and a control primer combination, 

thereby producing primer-annealed DNA, wherein said experimental 
primer combinations are selected from the group consisting of SEQ 
ID Nos. 2 it 3, 2 & 4, 5 & 7, 5 & 8, 5 & 9, 5 & 10, 5 & 11, 5 & 12, 5 & 
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13, 6 & 7, 6 & 8, 6 & 12, 6 & 13, 14 & 16 and 15 & 16, wherein said 
control primer combination is SEQ ID Nos. 17 & 18; 

(c) amplifying said primer-annealed DNA in a reaction, 
thereby producing amplicons, wherein reaction conditions for said 

5 amplification are optimized for sequence-specific amplification, 
wherein each experimental primer combination and said control 
primer combination are predicted to produce one or more amplicons 
having expected sizes in basepairs; and 

(d) separating said amplicons by size, wherein the 
10 presence of: 

a 363 bp with experimental primer combination SEQ ID 
Nos. 6 & 8, a 412 bp with experimental primer combination SEQ ID 
Nos. 6 & 13 and a 309 bp with experimental primer combination SEQ 
ED Nos. 14 & 16 is indicative of a CCR5 promoter genotype of 
15 P*0201/P*0201; 

a 363 bp with experimental primer combination SEQ ID 
Nos. 5 & 7, a 367 bp with experimental primer combination SEQ ID 
Nos. 5 & 10, a 412 bp with experimental primer combination SEQ ID 
Nos. 5 & 12 and a 309 bp with experimental primer combination SEQ 
20 ID Nos. 14 & 16 is indicative of a CCR5 promoter genotype of 
P*0101/P*0101; 

wherein a CCR5 promoter genotype of P*0201/P*0201 is 
predictive of a decreased probability of HIV-1 infection in said 
individual relative to an individual who does not possess said 
25 P*0201/P*0201 genotype, wherein a CCR5 promoter genotype of 
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P*0101/P*0101 is predictive of an increased probability of HIV-l 
infection lu said individual relative to an individual who does not 
possess said P*0101/P*0101 genotype. 



5 14. The method of claim 13, wherein said biological 

samples are selected from the group consisting of blood, serum, 
saliva, semen, tissue biopsy and DNA. 



15. The method of claim 13, wherein said separation is 
10 selected from the group consisting of gel electrophoresis and size 
fractionation. 



16. The method of claim 13, wherein said amplifying is 
by polymerase chain reaction (PCR). 

15 

17. A method of correlating CCR genotypes with HIV-1 
transmission and/or disease progression, comprising the steps of: 

(a) obtaining biological samples from a representative 
number of individuals, wherein each sample is from a different 

20 individual, wherein said sample comprises genomic DNA; 

(b) assessing each individual's HIV-1 status and/or risk 
of acquiring HIV-1; 
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(c) assigning each individual to a risk group, wherein 
said assignment is based upon said individual's HIV-1 status and/ or 
risk of acquiring HIV-1; 

(d) combining a portion of each sample with at least 
5 one experimental primer combination and a control primer 

combination, thereby producing primer-annealed DNA, wherein said 
experimental primer combinations are selected from the group 
consisting of SEQ ID Nos. 2 & 3, 2 & 4, 5 & 7, 5 & 8, 5 & 9, 5 & 10, 5 
& 11, 5 & 12, 5 & 13, 6 & 7, 6 & 8, 6 & 12, 6 & 13, 14 & 16 and 15 & 
10 16, wherein said control primer combination is SEQ ID Nos. 17 & 18; 

(e) amplifying said primer-annealed DNA in a reaction, 
thereby producing amplicons, wherein reaction conditions for said 
amplification are optimized for sequence-specific amplification, 
wherein each experimental primer combination and said control 

15 primer combination are predicted to produce one or more amplicons 
having expected sizes in basepairs; and 

(f) separating said amplicons by size, wherein the 
presence of: 

a 197 bp amplicon with said control primer combination 
20 is indicative of a CCR5 wildtype coding sequence; 

a 165 bp amplicon with said control primer combination 
is indicative of a CCR5-A32 coding sequence; 

a 363 bp amplicon with experimental primer combination 
SEQ ID Nos. 5 & 7, a 367 bp amplicon with experimental primer 
25 combination SEQ ID Nos. 5 & 10, a 412 bp amplicon with 
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experimental primer combination SEQ ID Nos. 5 & 12 and a 309 bp 
amplicon with experimental primer combination SEQ ID Nos. 14 & 1 6 
is indicative of a P*0101 CCR5 promoter allele; 

a 363 bp amplicon with experimental primer combination 
5 SEQ ID Nos. 5 & 7, a 367 bp amplicon with experimental primer 
combination SEQ ID Nos. 5 & 10, a 412 bp amplicon with 
experimental primer combination SEQ ID Nos. 5 & 13 and a 309 bp 
amplicon with experimental primer combination SEQ ID Nos. 14 & 1 6 
is indicative of a P*0102 CCR5 promoter allele; 

10 a 363 bp amplicon with experimental primer combination 

SEQ ID Nos. 5 & 7, a 363 bp amplicon with experimental primer 
combination SEQ ID Nos. 5 & 9, a 367 bp amplicon with experimental 
primer combination SEQ ID Nos. 5 & 11, a 412 bp amplicon with 
experimental primer combination SEQ ID Nos. 5 & 13 and a 309 bp 

IS amplicon with experimental primer combination SEQ ID Nos. 14 & 1 6 
is indicative of a P*0103 CCR5 promoter allele; 

a 363 bp amplicon with experimental primer combination 
SEQ ID Nos. 6 & 8, a 412 bp amplicon with experimental primer 
combination SEQ ID Nos. 6 & 13 and a 309 bp amplicon with 
20 experimental primer combination SEQ ID Nos. 14 & 16 is indicative of 
a P*0201 CCR5 promoter allele; 

a 363 bp amplicon with experimental primer combination 
SEQ ID Nos. 6 & 8, a 412 bp amplicon with experimental primer 
combination SEQ ID Nos. 6 & 13 and a 309 bp amplicon with 
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experimental primer combination SEQ ID Nos. 15 & 16 is indicative of 
a P*0202 CCR5 promoter allele; 

(g) determining a CCR genotype for each sample based 
upon said CCR alleles indicated following step (f); and 

5 (h) analyzing said genotypes determined in step (g), 

thereby genotyping said representative number of individuals with 
respect to said risk group assigned each individual, thereby 
correlating CCR genotypes with HIV-1 transmission and/or disease 
progression. 

10 18. The niethod of claim 17, wherein said biological 

samples are selected from the group consisting of blood, serum, 
saliva, semen, tissue biopsy and DNA. 

19. The method of claim 17, wherein said separation is 
15 selected from the group consisting of gel electrophoresis and size 

fractionation. 

20. The method of claim 17, wherein said amplifying is 
by polymerase chain reaction (PGR). 

20 

21. The method of claim 17, wherein said analyzing is 
by means selected from the group consisting of two-tailed Fisher's 
exact test, multiple logistic regression analysis, univariate analysis 
and multivariate analyses. 
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22. An oligonucleotide selected from the group 

consisting of SEQ ID Nos. 2-16. 
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SEQUENCE LISTING 

<110> Tang, Jianming 

Kaslow, Richard A. 

<120> Methods of Surveying For CC (Beta) Chemokine 

Receptor Variants and Their Association With HIV- 
1 Transmission cuid/or Disease Progression 

<130> D6217PCT 

<140> 

<141> 2000-08-11 

<150> 60/148,530 

<151> 1999-08-12 

<160> 35 

<210> 1 

<211> 18 

<212> DNA 

<213> Artificial Sequence 
<220> 

<221> primer 

<223> universal M13 primer 

<400> 1 
acaggaaaca gctatgac 18 

<210> 2 
<211> 18 
<212> DNA 

<213> Artificial Secjuence 

<220> 

<221> primer 

<223> CCR2b-5/lS, primer used for typing major 

polymorphism in CCR2b, CCR5 and the CCR5 
downstream promoter region 

<400> 2 

tgggcaacat gctggtcg 18 

<210> 3 
<211> 18 
<212> DNA 

<213> Artificial Sequence 

SEQl/11 
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<220> 

<221> primer 

<223> CCR2b-5/2S, primer used for typing major 

polymorphism in CCR2b, CCR5 and the CCR5 
downstream promoter region 

<400> 3 

tgggcaacat gctggtca 18 

<210> . 4 
<211> 21 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> primer 

<223> CCR2b-3/lG, primer used for typing major 

polymorphism in CCR2b, CCR5 and the CCR5 
dovmstream promoter region 

<400> 4 

tggaaaataa gggccacaga c 21 

<210> 5 
<211> 18 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> primer 

<223> CCR5P-5/1S, primer used for typing major 

polymorphism in CCR2b, CCR5 and the CCR5 
downstream promoter region 

<400> 5 

gagtggagaa aaaggggg 18 

<210> 6 

<211> 18 

<212> DNA 

<213> Artificial Sequence 
<220> 

<221> primer 



SEQ2/11 
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<223> CCR5P-5/2S, primer used for typing major 

polymorphism in CCR2b, CCR5 and the CCR5 
downstream promoter region 

<400> 6 

gagtggagaa aaagggga 18 

<210> 7 
<211> 22 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> primer 

<223> CCR5P-3/1S, primer used for typing major 

polymorphism in CCR2b, CCR5 and the CCR5 
downstream promoter region 

<400> 7 

agaat agate tctggtctga aa 22 

<210> 8 
<211> 22 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> primer 

<223> CCR5P-3/2S, primer used for typing major 

polymorphism in CCR2b, CCR5 and the CCR5 
downstream promoter region 

<400> 8 

agaatagatc tctggtctga ag 22 

<210> 9 
<211> 23 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> primer 

<223> CCR5P-3/3S, primer used for typing major 

polymorphism in CCR2b, CCR5 and the CCR5 
downstream promoter region 

<400> 9 

SEQ3/11 
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gagaatagat ctctggtcta aaa 23 

<210> 10 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<221> primer 

<223> CCR5P-3/4S, primer used for typing major 

polymorphism in CCR2b, CCR5 and the CCR5 
downstream promoter region 

<400> 10 

tagagaatag atctctggtc tg 22 

<210> 11 
<211> 22 
<212> Dm. 

<213> Artxticial Sequence 

<220> 

<221> primer 

<223> CCR5P-3/5S, primer used for typing major 

polymorphism in CCR2b, CCR5 and the CCR5 
downstream promoter region 

<400> 11 

tagagaatag atctctggtc ta 22 

<210> 12 
<211> 22 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> primer 

<223> CCR5P-3/6S, primer used for typing major 

polymorphism in CCR2b, CCR5 and the CCR5 
downstream promoter region 

<400> 12 

agaatcagag aacagttctt cc 22 

<210> 13 
<211> 22 

SEQ4/11 
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<212> DNA 

<213> Artificial Sequence 

<220> 

<221> primer 

<223> CCRBP-S/TS, primer used for typing major 

polymorphism in CCR2b, CCR5 and the CCR5 
downstream promoter region 

<400> 13 

agaatcagag aacagttctt ct 22 

<210> 14 
<211> 19 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> primer 

<223> CCR5P-5/3S, primer used for typing major 

polymorphism in CCR2b, CCR5 and the CCR5 
downstream promoter region 

<400> 14 

caggaaaccc atagaagac 19 

<210> 15 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<221> primer 

<223> CCR5P-5/4S, primer used for typing major 

polymorphism in CCR2b, CCR5 and the CCR5 
downstream promoter region 

<400> 15 

caggaaaccc atagaagat 19 

<210> 16 

<211> 19 

<212> DNA 

<213> Artificial Sequence 

<220> 

<221> primer 

SEQ 5/1 1 
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<223> CCR5P~3/8G, primer used for typing major 
polymorphism in CCR2b, CCR5 and the CCR5 
downstream promoter region 

<400> 16 

gtgggcacat attcagaag 19 

<210> 17 
<211> 20 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> primer 

<223> CCR5-SP4G, primer used for typing major 

polymorphism in CCR2b, CCR5 and the CCR5 
downstream promoter region 

<400> 17 

tcattacacc tgcagctctc 20 

<210> 18 
<211> 19 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> primer 

<223> CCR5-PM6G, primer used for typing major 

polymorphism in CCR2b, CCR5 and the CCR5 
downstream promoter region 

<400> 18 

tggtgaagat aagcctcac 19 

<210> 19 

<211> 5 

<212> DNA 

<213> Unknown 
<220> 

<221> gene 

<223> CCR5 promoter allele P*0101 

<400> 19 

gtcgc 

SEQ6/11 
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<210> 
<211> 
<212> 
<213> 
<220> 
<221> 
<223> 
<400> 

gtcac 

<210> 
<211> 
<212> 
<213> 
<220> 
<221> 
<223> 
<400> 

gttac 

<210> 
<211> 
<212> 
<213> 
<220> 
<221> 
<223> 
<400> 

accac 

<210> 
<211> 
<212> 
<213> 
<220> 
<221> 
<223> 
<400> 



PCT/USOO/22255 

20 
5 

DNA 

Unknown 
gene 

CCR5 promoter allele P*0102 
20 



21 
5 

DNA 

Unknown 
gene 

CCR5 promoter allele P*0103 
21 



22 
5 

DNA 

Unknown 
gene 

CCR5 promoter allele P*0201 
22 



23 
5 

DNA 

Unknown 
gene 

CCR5 promoter allele P*0202 
23 

SEQ 7/1 1 
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accat 

<210> 
<211> 
<212> 
<213> 
<220> 
<221> 
<223> 
<400> 
gtcggc 

<210> 
<211> 
<212> 
<213> 
<220> 
<221> 
<223> 
<400> 
gtcagc 

<210> 
<211> 
<212> 
<213> 
<220> 
<221> 
<223> 
<400> 
gttagc 

<210> 
<211> 
<212> 
<213> 
<220> 
<221> 



24 
6 

DNA 

Unknovm 
gene 

CCR5 promoter allele P*0101 
24 



25 
6 

im 

Unknown 
gene 

CCR5 promoter allele P*0102 
25 



26 
6 

DNk 

Unknown 
gene 

CCR5 promoter allele P*0103 
26 



27 
6 

Unknown 
gene 

SEQ8/11 
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<223> CCR5 promoter allele P*0104 

<400> 27 



gtcaac 



<210> 28 

<211> 6 

<212> DNA 

<213> Unknown 
<220> 

<221> gene 

<223> CCR5 promoter allele P*0201 

<400> 28 
accagc 



<210> 
<211> 
<212> 
<213> 
<220> 
<221> 
<223> 
<400> 
accagt 



29 
6 

DNA 

Unknovm 
gene 

CCR5 promoter allele P*0202 
29 



<210> 
<211> 
<212> 
<213> 
<220> 
<221> 
<223> 
<400> 



30 
24 

Artificial Sequence 



30 



gatagattat atctggagtg aaga 24 

<210> 31 

<211> 23 

<212> DNA 

<213> Artificial Sequence 



SEQ9/11 
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<220> 
<221> 
<223> 

<400> 31 
atagattata tctggagtga agg 23 

<210> 32 

<211> 22 

<212> DNA 

<213> Artificial Sequence 

<220> 

<221> 

<223> 

<400> 32 
aaagtctggt ctctagataa ga 22 

<210> 33 

<211> 22 

<212> DNA 

<213> Artificial Sequence 

<220> 

<221> 

<223> 

<400> 33 
gaagtctggt ctctagataa ga 22 

<210> 34 

<211> 25 

<212> DNA 

<213> Artificial Sequence 

<220> 

<221> 

<223> 

<400> 34 
ttaaaactct ttagacaaca ggttt 25 

<210> 35 
<211> 24 

SEQ 10/11 
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<212> DNA 

<213> Artificial Sequence 

<220> 

<221> 

<223> 

<400> 35 
taaaactctt tagacaacag gttg 24 
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sequence: listimg 

<110> Ta«igx Jianming 

Kaslovr, Ricliard h. 

<120> Methods of Surveying For CC {Beta) Chemokine 
Receptor Variants and Their Association With KIV- 
1 Transmission and/ or Disease Frogreseion 

<130> D6217PCT 

<140> 

<141> 2000-08-11 

<1S0> 60/14&,S30 

<151> 1999-06-12 

<160> 35 

<210> 1 

<211> IB 

<212> DNA 

<213> Artificial SaQuenee 
<220> 

<22X> primer 

<223> universal M13 primer 

<400> 1 
acojgigaaaca gctatgac 19 ' 

<210> 2 
<211> 18 
<212> UNA 

<213> Artificial Sequence 
<220> 

<221> pritner 

<223> OCR2to-5/lSr prittier ueed for typing major 
polyniotphijsin in CCR2b, CCR5 and the CCR5 
downstream promoter region 

<iO0> 2 

tg^caacat gctggtcg 18 

<210> 3 
<211> 18 
<212> DMA 

<2I3> Artificial Sfequence 

SBQIAl 
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WO01/12SS7 



<220> 

<221> pr iJi^x 

<223> CCR2b-5/2Sr primer used for tipping raajor 
pQl-iTHOrphism in CCR2b, CCR5 and the CCR5 
downs tr earn promoter region 

<400> 3 
tgggcaacat gctggtca 18 

<210> . d 
<211> 21 
<212> Wh 

<213> Artificial SequM« 

<220> 

<221> primer 

<223> CCR2b-3/l<3, prinier u:aed for typing major 
pclymaTphism in CCR2b, CCR5 acid th$ CCR5 
downs treaia promoter region 

<400> 4 

tggaaaataa gggccacaga c 21 

<21Q> B 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<221> primer 

<223> OCR5P-5/iS. primer used for typing major 
polymorpbissti ift CCR2br OCRS and the CCrS 
downstream pramoter region 

<400> 5 

^agtggagaa aaaggggg 16 



<210> 6 

<2ii> la 

<212> IMOA 

<213> Artificial s^quenc^e 
<220> 

<221> primer 



SEQ2/11 
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<223> CCR5P-5/2S, priroer used for typing inajor 

polymorphism in CCR2b, CCRS and the CCR5 
downstream promoter region 

<400> 6 

gagLggagaa aaagggga 3.8 



<210> 


7 


<211> 


22 


<212> 


DNA 


■<213> 


Artificial 


'<;220> 




<221> 


primer 


<223> 


CCR5P-3/1S 



<400> 



primer used for typing major 
polymorphism in C:CR2b, CCR5 and the CCR5 
daumatreain promotes; i^egion 



agaatogatc tctggtctga aa 22 



<210> B 
<211> 22 
<212> EMA 

<2li> Aartdficial Sequmce 
<220> 

<221> primer 

<223> cCR5P-3/2Sr pciitter uaed for typing rrajor 
polymorphiam in CCR2b, CCRS and the CCR5 
downstneam prtmoter region 

<400> 8 

agaatagato tctggtctga ag 22 



<210> 9 
<211> 23 
<212> DKA 

<213> Artificial Sequence 

<220> 

<22l> primer 

<223> CCR5P-3/3Sr primer used for typing major 

polymorphism in CCR2b, CCR5 and the CCRS 
downstream proftioter region 

<400> 9 



SEQ3/I1 
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gagaatagat cectggteta aaa 23 



<21Q> 


10 


<2ll> 


22 


<212> 




<213> 


Az-tificial 


<220> 




<221> 


primer 


«223> 


OCR5P-3/4S 



<400> 



primer used foe 
polymorphism in 0CR2b, CCR5 
downs tr«aan promoter region 

10 



typing 
and the 



ma] or 
CCR5 



tiagagaatag atctctggtc tg 22 



<210> 11 
<211> 22 
<212> DKk 

<:213> Artificial iSdquence 
<Z2<i> 

<221> priiner 

<223> CCR5P-3/5Sj priraex used for typing major 

polymprphism in CCR2b, CCR5 aiid the OCRS 
doimstream promoter region 

<400> 11 

tagagaatag atctctggtc ta 22 

<210> 12 
<211> 22 
<212> DMA 

<213> Attificia.1 Sequence 

<2iO> 

•s221> pxiiuer 

<323> CCRSP~3/6S, primer " Lised for typing major 

polyroorpMsitt in CCR2b^ OCRS an^ the CCRS 
downstream promoter region 

<40t)> 13 

agaatcsigag aacagttctt cc 22 



<:210> 13 
<211> 22 



SEQ4/11 
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<212> DNjV 

<213> AttificiAl Sequence 

<220> 

<321> primer 

<2£3> CCR5P-3/7S, primer used for typing major 
poljmorphiBin in CCR2b, CCRS and the CCR5 
downstream proraotsr region 

<400> 13 

agaatcagag aaoagttett <^t 22 

<21D> U 
<211> 19 
<212> DMA 

<213> ArtlftciAl se<5uence 

<220> 

<22i> primer 

<223> CCR5P~5/3Sr primer used for typing major 
poli-rtiorphlam in CCR2bj CCRS and the CCR5 
downs treauEi promoter region 

<400> Id 

caggaaaccQ atagaagac 19 

<210> 15 
<211> 19 
<212> IMA 

<213> ArtifieiaL Sequence 
<220> 

<221> primer 

<223> CCR5P-5/4S, primer used for typing raajor 

p<;>IysnDrphism in CCR2br CCRS and the CCRS 
downs treeun promoter region 

<4D(» 15 

caggaaaccc atagaagat 19 

<210> 16 

<211> 19 

<212> DMA 

<213s» Artificial Sequence 
<220> 

<221> primer 

SEQ5/M 



0112857A2TI > 
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<223> CCR5P-2/8G. primer used for typing major 

polyriiorphisni ±n CCR2b. CCR5 and tJie CCR5 
dovflistream proraotLBT region 

<40D> 16 

gtgggcacat attcagaag IS 



<210> 


17 


<2Lli^ 


20 


<212> 




<213s. 


Artificial 


<320> 




<221> 




<223> 


CCR5-SP4fi. 



<400> 

tcatt;acacc tgcagccctc 



primer u&ed fgf 
polytiiorphiero in CCR2b, GCR5 
dovjnstream promoter region 

17 



typing znajor 
and ttie CCJtS 



20 



<2io> le 

<211> 19 
<212> im 

<213> Artificial Sequence 
<220> 

<221> primer 

<223> CCR5-WlSGf priitver uged for typing major 

polytnc>£phism in CCH2b, CCR5 and the CCR5 
downstream promoter region 

<400> 16 

tgst?A&9At: aagcctcac t9 



<210> 
<211> 
<212> 
<213> 
<220> 
<221> 
<223> 
<400> 



19 
5 

DNA 

Unknown 
gene 

OCRS prorooter allele P^OlOl 
19 



gtcgc 



SEQfi/ll 
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wo iii/ms7 

<:2ia> 
<211> 
<212> 
<213> 
<22Q> 
<22l> 
<223> 
<400> 

<210> 
<211> 
<212> 
<213> 
<220> 
<221> 
<223> 
<400> 

gttac 

<:210> 
<211> 
<;212> 
<213> 
<22D> 
<231> 
<223> 
<lDCi> 

accac 

<210> 
<211> 
<212> 
<213> 
<220> 
<221> 
<Z23> 
<400> 



^ PCTrt)S00«i255 

20 
5 

Unknown 
gena 

CCR5 promoter allele P*0102 

20 



21 
5 

Unknown 
gens 

promoter allele P*0l03 

21 



22 
S 

mm 

Un^cnoMn 
gene 

CCR5 promoter allele P*0201 
22 



23 
5 

xm 

Tlhloiown 
gene 

CCR5 promoter allele P*0202 
23 

SEQ7/11 
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accat 



<210> 24 

<aia> 6 

<212> DNA 

«2L3> Unknown 
<220> 

<221> gene 

<2.23> CCR5 pxoniotex allele P*0101 

<dOO> 24 
gtcggc 



<210> 25 

<211> 6 

<212> UNIi 

<213> Unknown 
<220> 

<221> gene 

<223> CCR5 prciiioter allele P*D102 

<dQO> 25 
gtcagc 



<210> 26 

<211> 6 

<212> DNA 

<213> Utiknown 
<220> 

<32l> gwe 

<223> CCR5 promoter allele P'^0103 

<4O0> 26 
gttagc 



<210> 27 

<211> 6 

<212> DNA 

<213> Unknown 
<220> 

<221> 9anA 

SEQ 



BNSDOCID:<WO 0U2e57A2TJ > 
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<223> CCR5 prcauoter allele p*010d 

<400> 27 



<210> 28 

<211> 6 

<212> DMA 

<213> CMknown 
<220> 

<221> §ene 

<225> CCR5 promoter allele P*0201 

<400> ae 



<210> 2$ 

<211> 6 

<212> r>}Hh 

<213> XMaianra. 
<220> 

<221> gene 

<223> CCR5 prciraater allele P*0202 

<40D> 25 
accagt. 



<210> 30 

<211> 2i 

<212> DMA 

<213> Artificial Sequence 

<220> 

<221> 

<223> 

<400>- 30 
gatagattat atctggagtg 4aga 24 



<210> 
<211> 

<ai2> 

<213> 



31 
23 
DHA 

Artificial Sequence 
SBQWIl 
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pcmismuziiss 



<22Q> 
<221> 
<223> 

<400> 3X 
ata.gattata tctggagtga agg 22 

<210> 32 

<21X> 22 

<212> DNft. 

<213> Artificial sega^nce 

<220> 

<22i> 

<223> 

<400> 32 
^^tctgrgt ctctagataa ga 22 

<210> 33 

<211> 22 

<212> EWA 

<213> Artificial Sequence 

<220> 

<221> 

<223> 

<40Q> 33 
gaagtctggt ctctagataa ga 22 

<21Q> 34 

<21L> 25 

<212> DNft 

<213> Artificial Sequence 

<220> 

<221> 

<a23> 

<40D> 34 
ttaaaactct ttagacaaca ggttt 35 

<21D> 35 
<2U> 24 

SBQ l(Vli 
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ic212> 
<213> 
<220> 
<221> 
<223> 
<dO0> 



Artificial Sequence 



35 



taaaactctt tagacaac&g gttg 24 
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